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Preface 



The major expectation from the fourth generation (4G) of wireless communication networks 
is to be able to handle much higher data rates, which will be in the range of 1Gb in 
the WLAN environment and 100 Mb in cellular networks. A user, with a large range of 
mobility, will access the network and will be able to seamlessly reconnect to different 
networks, even within the same session. The spectra allocation is expected to be more 
flexible, and even flexible spectra sharing among the different subnetworks is anticipated. 
In such a 'composite radio environment' (CRE), there will be a need for more adaptive and 
reconfigurable solutions on all layers in the network. For this reason the first part of the book 
deals with adaptive link, MAC, network and TCP layers including a chapter on crosslayer 
optimization. This is followed by chapters on mobility management and adaptive radio 
resource management. The composite radio environment will include presence of WLAN, 
cellular mobile networks, digital video broadcasting, satellite, mobile ad hoc and sensor 
networks. 

Two additional chapters on ad hoc and sensor networks should help the reader understand 
the main problems and available solutions in these fields. The above chapters are followed 
by a chapter on security, which is a very important segment of wireless networks. 

Within the more advanced solutions, the chapter on active networks covers topics like 
programmable networks, reference models, evolution to 4G wireless networks, 4G mobile 
network architecture, cognitive packet networks, the random neural networks based algo- 
rithms, game theory models in cognitive radio networks, cognitive radio networks as a game 
and biologically inspired networks, including bionet architecture. 

Among other topics, the chapter on networks management includes self-organization 
in 4G networks, mobile agent-based network management, mobile agent platform, mobile 
agents in multioperator networks, integration of routing algorithm and mobile agents and 
ad hoc network management. 

Network information theory has become an important segment of the research, and the 
chapter covering this topic includes effective capacity of advanced cellular network, capacity 
of ad hoc networks, information theory and network architectures, cooperative transmission 
in wireless multihop ad hoc networks, network coding, capacity of wireless networks using 
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MIMO technology and capacity of sensor networks with many-to-one transmissions. Two 
additional chapters, energy efficient wireless networks and QoS management, are also 
included in the book. 

As an extra resource a significant amount of material is available on the book's com- 
panion website at www.wiley.com/go/glisic in the form of three comprehensive appendices: 
Appendix A provides a review of the protocol stacks for the most important existing wire- 
less networks, Appendix B presents a comprehensive review of results for the MAC layer 
and Appendix C provides an introduction to queueing theory. 

The material included in this book is a result of the collective effort of researchers across 
the globe. Whenever appropriate, the reference to the original work, measurement results 
or diagrams is made. The lists of references includes approximately 2000 titles. 

Discussions and cooperation with Professor P. R. Kumar, of the Coordinated Science 
Laboratory, University of Illinois at Urbana-Champaign, had a significant impact, espe- 
cially on the network information theory material presented in the book. Professor Imrich 
Chlamtac, of University of Texas at Dallas helped a great deal with the material regard- 
ing bioinspired nets. Professor Carlos Pomalaza-Raes, of Indiana-Purdue University, USA, 
inspired the presentation on ad hoc and sensor networks. Professor Kaveh Pahlavan of 
Worchester Polytechnic Institute, Massachusetts, inspired the presentations of the WLAN 
technology. Dr. Moe Win of Massachusetts Institute of Technology provided a set of original 
diagrams on Ultra Wide Band Channel measurements. 

The author would also like to thank Professor P. Leppanen, J.P Makela, P. Nissinaho 
and Z. Nikolic, for their help with the graphics. 



Savo G. Glisic 
Oulu 
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Fundamentals 



1 .1 4G NETWORKS AND COMPOSITE RADIO ENVIRONMENT 

In the wireless communications community we are witnessing more and more the existence 
of the composite radio environment ( CRE) and as a consequence the need for reconfigura- 
bility concepts. The CRE assumes that different radio networks can be cooperating compo- 
nents in a heterogeneous wireless access infrastructure, through which network providers 
can more efficiently achieve the required capacity and quality of service (QoS) levels. Re- 
configurability enables terminals and network elements to dynamically select and adapt 
to the most appropriate radio access technologies for handling conditions encountered in 
specific service area regions and time zones of the day. Both concepts pose new require- 
ments on the management of wireless systems. Nowadays, a multiplicity of radio access 
technology (RAT) standards are used in wireless communications. As shown in Figure 1.1, 
these technologies can be roughly categorized into four sets: 

• Cellular networks that include second-generation (2G) mobile systems, such as Global 
System for Mobile Communications (GSM) [1] , and their evolutions, often called 2.5G 
systems, such as enhanced digital GSM evolution (EDGE), General Packet Radio Service 
(GPRS) [2] and IS 136 in the USA. These systems are based on TDMA technology. 
Third-generation (3G) mobile networks, known as Universal Mobile Telecommunica- 
tions Systems (UMTS ; WCDMA and cdma2000) [3] are based on CDMA technology that 
provides up to 2 Mbit/s. In these networks 4G solutions are expected to provide up to 
100 Mbit/s. The solutions will be based on a combination of multicarrier and space- 
time signal formats. The network architectures include macro- micro- and picocellular 
networks and home (HAN) and personal area networks (PAN). 

• Broadband radio access networks (BRANs) [4] , or wireless local area networks (WLANs) 
[5], which are expected to provide up to 1 Gb/s in 4G. These technologies are based on 
orthogonal frequency division multiple access (OFDMA) and space-time coding. 
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2 FUNDAMENTALS 




Figure 1.1 Composite radio environment in 4G networks. 



• Digital video broadcasting (DVB) [6] and satellite communications. 

• Ad hoc and sensor networks with emerging applications. 

Although 4G is open for new multiple access schemes, the CRE concept remains attrac- 
tive for increasing the service provision efficiency and the exploitation possibilities of the 
available RATs. The main assumption is that the different radio networks , GPRS, UMTS, 
BRAN/WLAN, DVB, and so on, can be components of a heterogeneous wireless access 
infrastructure. A network provider (NP) can own several components of the CR infras- 
tructure (in other words, can own licenses for deploying and operating different RATs), 
and can also cooperate with affiliated NPs. In any case, an NP can rely on several alterna- 
tive radio networks and technologies to achieve the required capacity and QoS levels, in a 
cost-efficient manner. Users are directed to the most appropriate radio networks and tech- 
nologies, at different service area regions and time zones of the day, based on profile 
requirements and network performance criteria. The various RATs are thus used in a 
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complementary manner rather than competing with each other. Even nowadays a mobile 
handset can make a handoff between different RATs. The deployment of CRE systems can 
be facilitated by the reconfigurability concept, which is an evolution of software-defined 
radio [7, 8]. CRE requires terminals that are able to work with different RATs and the 
existence of multiple radio networks, offering alternative wireless access capabilities to 
service area regions. Reconfigurability supports the CRE concept by providing essential 
technologies that enable terminals and network elements to dynamically (transparently and 
securely) select and adapt to the set of RATs, that is most appropriate for the conditions 
encountered in specific service area regions and time zones of the day. According to the 
reconfigurability concept, RAT selection is not restricted to those technologies pre-installed 
in the network element. In fact, the required software components can be dynamically down- 
loaded, installed and validated. This makes it different from the static paradigm regarding 
the capabilities of terminals and network elements. 

The networks provide wireless access to IP-based applications, and service continuity in 
light of intrasystem mobility. Integration of the network segments in the CR infrastructure is 
achieved through the management system for CRE (MSCRE) component attached to each 
network. The management system in each network manages a specific radio technology; 
however, the platforms can cooperate. The fixed (core and backbone) network will consist of 
public and private segments based on IPv4 and IPv6-based infrastructure. Mobile IP (MIP) 
will enable the maintenance of IP-level connectivity regardless of the likely changes in the 
underlying radio technologies used that will be imposed by the CRE concept. Figures 1.2 
and 1.3 depict the architecture of a terminal that is capable of operating in a CRE context. 
The terminals include software and hardware components (layer 1 and 2 functionalities) 
for operating with different systems. The higher protocol layers, in accordance with their 
peer entities in the network, support continuous access to IP-based applications. Different 
protocol busters can further enhance the efficiency of the protocol stack. There is a need 
to provide the best possible IP performance over wireless links, including legacy systems. 




Figure 1 .2 Architecture of a terminal that operates in a composite radio environment. 
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jure 1.3 (a) Architecture of terminal that operates in the reconfigurability context. 

(b) Fixed spectrum allocation compared to contiguous and fragmented DSA. 

(c) DSA operation configurations: (1) static (current spectrum allocations); 
(2) continuous DSA operations; (3) discrete DSA operations. 
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Within the performance implications of link characteristics (PILC) IETF group, the con- 
cept of a performance-enhancing proxy (PEP) [9-12] has been chosen to refer to a set of 
methods used to improve the performance of Internet protocols on network paths where 
native TCP/IP performance is degraded due to the characteristics of a link. Different types 
of PEPs, depending on their basic functioning, are also distinguished. Some of them try to 
compensate for the poor performance by modifying the protocols themselves. In contrast, 
a symmetric/asymmetric boosting approach, transparent to the upper layers, is often both 
more efficient and flexible. A common framework to house a number of different protocol 
boosters provides high flexibility, as it may adapt to both the characteristics of the traffic 
being delivered and the particular conditions of the links. In this sense, a control plane for 
easing the required information sharing (cross-layer communication and configurability) 
is needed. Furthermore, another requirement comes from the appearance of multihop com- 
munications as PEPs have traditionally been used over the last hop, so they should be 
adapted to the multihop scenario. Most communications networks are subject to time and 
regional variations in traffic demands, which lead to variations in the degree to which the 
spectrum is utilized. Therefore, a service's radio spectrum can be underused at certain 
times or geographical areas, while another service may experience a shortage at the same 
time/place. Given the high economic value placed on the radio spectrum and the importance 
of spectrum efficiency, it is clear that wastage of radio spectrum must be avoided. These 
issues provide the motivation for a scheme called dynamic spectrum allocation (DSA), 
which aims to manage the spectrum utilized by a converged radio system and share it 
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between participating radio networks over space and time to increase overall spectrum 
efficiency, as shown in Figure 1.3(b, c). 

Composite radio systems and reconfigurability, discussed above, are potential enablers 
of DSA systems. Composite radio systems allow seamless delivery of services through the 
most appropriate access network, and close network cooperation can facilitate the sharing 
not only of services, but also of spectrum. Reconfigurability is also a very important issue, 
since with a DSA system a radio access network could potentially be allocated any frequency 
at any time in any location. It should be noted that the application layer is enhanced with the 
means to synchronize various information streams of the same application, which could be 
transported simultaneously over different RATs. The terminal management system (TMS) 
is essential for providing functionality that exploits the CRE. On the user/terminal side, 
the main focus is on the determination of the networks that provide, in a cost-efficient 
manner, the best QoS levels for the set of active applications. A first requirement is that 
the MS-CRE should exploit the capabilities of the CR infrastructure. This can be done in a 
reactive or proactive manner. Reactively, the MS-CRE reacts to new service area conditions, 
such as the unexpected emergence of hot spots. Proactively, the management system can 
anticipate changes in the demand pattern. Such situations can be alleviated by using alternate 
components of the CR infrastructure to achieve the required capacity and QoS levels. The 
second requirement is that the MS-CRE should provide resource brokerage functionality 
to enable the cooperation of the networks of the CR infrastructure. Finally, parts of the 
MS-CRE should be capable of directing users to the most appropriate networks of the 
CR infrastructure, where they will obtain services efficiently in terms of cost and QoS. To 
achieved the above requirements an MS architecture such as that shown in Figure 1.4 is 
required. 



Short-term ^ Mid-term 

operation operation 




User and control 
plane interface 



Figure 1.4 Architecture of the MS-CRE. 
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3c. Offer request 



4a. Optimization request 




Figure 1 .5 MS-CRE operation scenario. 

The architecture consists of three main logical entities: 

• monitoring, service-level information and resource brokerage (MSRB); 

• resource management strategies (RMS); 

• session managers (SMs). 

The MSRB entity identifies the triggers (events) that should be handled by the MS-CRE 
and provides corresponding auxiliary (supporting) functionality. The RMS entity provides 
the necessary optimization functionality. The SM entity is in charge of interacting with 
the active subscribed users/terminals. The operation steps and cooperation of the RMS 
components are shown in Figures 1.5 and 1.6, respectively. In order to get an insight into 
the scope and range of possible reconfigurations, we review in Appendix A (please go to 
www.wiley.com/go/glisic) the network and protocol stack architectures [1-58] of the basic 
CRE components as indicated in Figure 1.1. 

1.2 PROTOCOL BOOSTERS 

As pointed out in Figure 1.2, an element of the reconfiguration in 4G networks is protocol 
booster. A protocol booster is a software or hardware module that transparently improves 
protocol performance. The booster can reside anywhere in the network or end systems, and 
may operate independently (one-element booster), or in cooperation with other protocol 
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Figure 1.6 Cooperation of the RMS components. 



Protocol messages 




+ >. 

Booster messages 



Figure 1 .7 Two-element booster. 

boosters (multielement booster). Protocol boosters provide an architectural alternative to 
existing protocol adaptation techniques, such as protocol conversion. A protocol booster is a 
supporting agent that by itself is not a protocol. It may add, delete or delay protocol messages, 
but never originates, terminates or converts that protocol. A multielement protocol booster 
may define new protocol messages to exchange among themselves, but these protocols are 
originated and terminated by protocol booster elements, and are not visible or meaningful 
external to the booster. Figure 1.7 shows the information flow in a generic two-element 
booster. A protocol booster is transparent to the protocol being boosted. Thus, the elimination 
of a protocol booster will not prevent end-to-end communication, as would, for example, 
the removal of one end of a conversion [e.g. transport control protocol/Internet protocol 
(TCP/IP) header compression unit [13]]. In what follows we will present examples of 
protocol busters. 
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1 .2.1 One-element error detection booster for UDP 

UDP has an optional 16-bit checksum field in the header. If it contains the value zero, it 
means that the checksum was not computed by the source. Computing this checksum may 
be wasteful on a reliable LAN. On the other hand, if errors are possible, the checksum 
greatly improves data integrity. A transmitter sending data does not compute a checksum 
for either local or remote destinations. For reliable local communication, this saves the 
checksum computation (at the source and destination). For wide-area communication, the 
single-element error detection booster computes the checksum and puts it into the UDP 
header. The booster could be located either in the source host (below the level of UDP) or 
in a gateway machine. 

1 .2.2 One-element ACK compression booster for TCP 

On a system with asymmetric channel speeds, such as broadcast satellite, the forward (data) 
channel may be considerably faster than the return (acknowledgment, ACK) channel. On 
such a system, many TCP ACKs may build up in a queue, increasing round-trip time, and 
thus reducing the transmission rate for a given TCP window size. The nature of TCP's 
cumulative ACKs means that any ACK acknowledges at least as many bytes of data as any 
earlier ACK. Consequently, if several ACKs are in a queue, it is necessary to keep only the 
ACK that has arrived most recently. A simple ACK compression booster could insure that 
only a single ACK exists in the queue for each TCP connection. (A more sophisticated ACK 
compression booster allows some duplicate ACKs to pass, allowing the TCP transmitter to 
get a better picture of network congestion.) The booster increases the protocol performance 
because it reduces the ACK latency, and allows faster transmission for a given window size. 

1.2.3 One-element congestion control booster for TCP 

Congestion control reduces buffer overflow loss by reducing the transmission rate at the 
source when the network is congested. A TCP transmitter deduces information about net- 
work congestion by examining ACKs sent by the TCP receiver. If the transmitter sees 
several ACKs with the same sequence number, then it assumes that network congestion 
has caused a loss of data messages. If congestion is noted in a subnet, then a congestion 
control booster could artificially produce duplicate ACKs. The TCP receiver would think 
that data messages had been lost because of congestion, and would reduce its window size, 
thus reducing the amount of data it injected into the network. 

1 .2.4 One-element ARQ booster for TCP 

TCP uses ARQ to retransmit data unacknowledged by the receiver when a packet loss is 
suspected, such as after a retransmission time-out expires. If we assume the network of 
Figure 1.7 (except that booster B does not exist), then an ARQ booster for TCP: (1) will 
cache packets from host Y; (2) if it sees a duplicate acknowledgment arrive from host X 
and it has the next packet in the cache, then deletes the acknowledgment and retransmits 
the next packet (because a packet must have been lost between the booster and host X); and 
(3) will delete packets retransmitted from host Y that have been acknowledged by host X. 
The ARQ booster improves performance by shortening the retransmission path. A typical 
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application would be if host X were on a wireless network and the booster were on the 
interface between the wireless and wireline networks. 

1 .2.5 A forward erasure correction booster for IP or TCP 

For many real-time and multicast applications, forward error correction coding is desirable. 
The two-element forward error correcting (FEC) booster uses a packet forward error cor- 
rection code and erasure decoding. The FEC booster at the transmitter side of the network 
adds parity packets. The FEC booster at the receiver side removes the parity packets and 
regenerates missing data packets. The FEC booster can be applied between any two points 
in a network (including the end systems). If applied to IP, then a sequence number booster 
adds sequence number information to the data packets before the first FEC booster. If ap- 
plied to TCP (or any protocol with sequence number information), then the FEC booster 
can be more efficient because: (1) it does not need to add sequence numbers; and (2) it 
could add new parity information on TCP retransmissions (rather than repeating the same 
parities). At the receiver side, the FEC booster could combine information from multiple 
TCP retransmissions for FEC decoding. 

1 .2.6 Two-element jitter control booster for IP 

For real-time communication, we may be interested in bounding the amount of jitter that 
occurs in the network. A jitter control booster can be used to reduce jitter at the expense 
of increased latency. At the first booster element, timestamps are generated for each data 
message that passes. These timestamps are transmitted to the second booster element, which 
delays messages and attempts to reproduce the intermessage interval that was measured by 
the first booster element. 

1 .2.7 Two-element selective ARQ booster for IP or TCP 

For links with significant error rates, using a selective automatic repeat request (ARQ) 
protocol (with selective acknowledgment and selective retransmission) can significantly 
improve the efficiency compared with using TCP's ARQ (with cumulative acknowledg- 
ment and possibly go-back-TV retransmission). The two-element ARQ booster uses a se- 
lective ARQ booster to supplement TCP by: (1) caching packets in the upstream booster; 
(2) sending negative acknowledgments when gaps are detected in the downstream booster; 
and (3) selectively retransmitting the packets requested in the negative acknowledgments 
(if they are in the cache). 

1 .3 HYBRID 4G WIRELESS NETWORK PROTOCOLS 

As indicated in Appendix A (please go to www.wiley.com/go/glisic), there are two basic 
types of structure for WLAN: 

(1) Infrastructure WLAN - BS-oriented network. Single-hop (or cellular) networks that 
require fixed base stations (BS) interconnected by a wired backbone. 
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(2) Noninfrastructure WLAN - ad hoc WLAN. Unlike the BS-oriented network, which 
has BSs providing coverage for mobile hosts (MHs), ad hoc networks do not have 
any centralized administration or standard support services regularly available on 
the network to which the hosts may normally be connected. MHs depend on each 
other for communication. 



The BS-oriented network is more reliable and has better performance. However, the ad 
hoc network topology is more desirable because of its low cost, plug-and-play property, 
flexibility, minimal human interaction requirements, and especially battery power efficiency. 
It is suitable for communication in a closed area - for example, on a campus or in a 
building. 

To combine their strength, possible 4G concepts might prefer to add BSs to an ad hoc 
network. To save access bandwidth and battery power and have fast connection, the MHs 
could use an ad hoc wireless network when communicating with each other in a small area. 
When the MHs move out of the transmitting range, the BS could participate at this time and 
serve as an intermediate node. The proposed method also solves some problems, such as 
a BS failure or weak connection under ad hoc networks. The MHs can communicate with 
one another in a flexible way and freely move anywhere with seamless handoff. 

There have been many techniques or concepts proposed for supporting a WLAN with 
and without infrastructure, such as IEEE802.1 1 [14], HIPERLAN [15], and ad hoc WATM 
LAN [16]. The standardization activities in 1EEE802.1 1 and HIPERLAN have recognized 
the usefulness of the ad hoc networking mode. IEEE 802. 1 1 enhances the ad hoc function to 
the MH. HIPERLAN combines the functions of two infrastructures into the MH. Contrary 
to IEEE802.il and HIPERLAN, the ad hoc WATM LAN concept is based on the same 
centralized wireless control framework as the BS-oriented system, but insures that MH 
designed for the BS-oriented system can also participate in ad hoc networking. Both the 
BS oriented and ad hoc networks have some drawbacks. In the BS-oriented networks, BS 
manages all the MHs within the cell area and controls handoff procedures. It plays a very 
important role for WLAN. If it does not work, the communication of MHs in this area will 
be disrupted. Under this situation, some MHs could still transmit messages to each other 
without BS. Therefore, to increase the reliability and efficiency of the BS-oriented network, 
MH-to-MH direct transmission capability can be added. However, this is restricted to at 
most two hops such that this new enhancement will not increase the protocol complexity 
too much. 

In the ad hoc networks, it is not easy to rebuild or maintain a connection. When the 
connection is built, it will be disrupted any time one MH moves out of the connection 
range. So, as a compromise, the MHs could communicate with each other over the wireless 
media, without any support from the infrastructure network components within the signal 
transmission range. Yet when the transmission range is less than the distance between the 
two MHs, the MHs could change back to the BS-oriented systems. MH would be able to 
operate in both ad hoc and BS-oriented WLAN environments. 

Two different methods - one-hop and two-hop direct transmission within the BS-oriented 
concept - will be considered. The first method is simple and controlled by the signal strength. 
The second method should include the data forwarding and implementation of routing 
tables. 
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1.3.1 Control messages and state transition diagrams 

To integrate the BS-oriented method and the direct transmission method, we define some 
control messages: 

(1) AC ¥J ACCEPT/REJECT - used to indicate the acknowledgment, acceptance, or 
denial of connection or handoff request; 

(2) CHANGE - used by MH to inform the sender to initiate the handoff procedure; 

(3) DIRECT - used by MH to inform BS that the transmission is in direct transmission 
mode; 

(4) SEARCH - used to find the destination; each MH receiving this message must check 
the destination address for a match; 

(5) SETUP - used to establish a new connection; 

(6) TEARDO WN - used for switching from BS-oriented handoff to direct transmission; 
it will let BS release the channel and buffer; 

(7) AGENT - used by the MH whose BS fails to accept another MH acting as a surrogate 
for transmission; 

(8) BELONG - used by a surrogate MH to accept another MH's WHOSE-BS-ALIVE 
request; 

(9) WHOSE-BS-ALIVE - used by the MH whose BS failed to find a surrogate MH. 

Since a mobile host may be in BS-oriented, one-hop direct-transmission mode or two- 
hop direct-transmission mode, it is important that we understand the timing for mode 
transition. 

Figure 1.8 shows the state transition diagram. The meaning and timing of each transition 
are explained below: 

(1) The receiver can receive the sender's signal directly. 

(2) The receiver is a neighbor of a neighbor of the sender. 

(3) Neither case 1 nor 2. 

(4) The receiver can no longer hear the sender's signal; however, a neighbor of the sender 
can communicate with the receiver directly. 

(5) The receiver discovers that it can hear the sender's signal directly. 

(6) The receiver can no longer hear the sender's signal, and none of the sender's neigh- 
bors can communicate directly with the receiver. 

(7) The receiver discovers that it can hear the sender's signal directly. 

(8) No neighbors of the sender can communicate with the receiver directly. 

(9) The sender's original relay neighbor fails. However, the sender can find another 
neighbor that can communicate with the receiver directly. 



HYBRID 4G WIRELESS NETWORK PROTOCOLS 13 




Figure 1.8 Transition diagram for transmission mode. 

(10) The handoff from one BS to another occurs. In Figure 1.8, we note the following 
two points: (a) when a mobile host starts communication, it could be in any mode 
depending on the position of the receiving mobile host; and (b) the transition from 
the BS-oriented to two-hop direct-transmission mode is not possible because the 
communicating party cannot know that a third mobile host exists and is within 
range. 



1.3.2 Direct transmission 

Direct transmission defines the situation where two mobile hosts communicate directly or 
use a third mobile host as a relay without the help of base stations. This section considers 
the location management and handoff procedures when the MH moves around. These func- 
tions are almost the same as the traditional ones. However, the system must decide whether 
one-hop direct transmission, two-hop direct transmission or BS-oriented transmission 
method should be used. When the sender broadcasts the connection request message, both 
the BS and the MH within the sender's signal covering area receive this message. Each 
MH receiving the message checks the destination ID. If the destination ID matches itself, 
the transmission uses the one-hop direct-transmission method. (If we allow two-hop direct 
transmission, each receiving mobile host must check its neighbor database to see if the 
destination is currently a neighbor of itself.) Otherwise, the BS is used for connection. 
When the destination moves out of the covering area, the BS has to take over. On the other 
hand, when the MH moves into the covering range, the receiver has the option to stop going 
through the BS and changes to one-hop direct transmission. 



14 FUNDAMENTALS 



Direct transmission message 

BS-oriented message 




Figure 1.9 Using one-hop direct-transmission mode. 

1 .3.3 The protocol for one-hop direct transmission 

For one-hop direct-transmission mode, each case of protocol operations is described in 
more detail as follows: 

1.3.3. 1 One-hop direct-transmission mode 

(1) The sender broadcasts a SEARCH message. Every node in the signal covering range 
(including the BS) receives the message, as shown in Figure 1.9. 

(2) If the receiver is within the range, it receives the message and finds out the destination 
is itself. It responds with the message ACK back to the sender. 

(3) At the same time, the BS also receives the SEARCH message. It locates the MH and 
sends the SETUP message to the destination. For direct transmission, the destination 
receives the SETUP message and sends the DIRECT message to BS. Otherwise, it 
sends an ACCEPT message to the BS, and the communication is in BS-oriented 
mode. 

(4) The sender continues transmitting directly until the MH moves out of the covering 
area. 

1.3.3.2 BS-oriented Mode 

(1) The sender broadcasts a SEARCH message. If the receiver is out of the covering 
range, it will not receive the message, as in Figure 1.10. (It is possible that two-hop 
direct-transmission mode can be used. This will be explained later.) 

(2) However, the BS of the sender always receives the SEARCH message. It queries the 
receiver's position and sends the SETUP message to the destination. 

(3) When the destination receives the SETUP message, it sends an ACCEPT message 
to the BS. 
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Figure 1.10 Using the BS-oriented mode. 

(4) The communication continues by BS-oriented mode until the distance between the 
two MHs is close enough and the receiver wants to change to direct transmission 
mode. 

1.3.3.3 Handoff - out of direct transmission range 

In direct transmission mode, when the destination detects that the strength of the signal is 
less than an acceptable value, handoff should be executed. The procedures are described as 
follows: 

(1) The destination sends the CHANGE message to its sender. 

(2) As the sender receives the CHANGE request, it will send out the SEARCH message 
again. Then a BS-oriented mode will be used or a two-hop direct-transmission mode, 
as explained later. 

1.3.3.4 Handoff-BS to one-hop direct transmission 

If the receiver detects that it is within the covering region of the sender's signal and the 
signal is strong enough, it has the option of switching from the BS-oriented mode to one-hop 
direct-transmission mode. Each step is described below and presented in Figure 1.11. 

(1) The sender sends the SEARCH message out. Then the one-hop direct-transmission 
will be established. 

(2) After the sender receives the ACCEPT message from the receiver, it sends a 
TEARDOWN message to the BS and breaks the connection along the path. 

1 .3.4 Protocols for two-hop direct-transmission mode 

Two-hop direct-transmission mode will cover a wider area than one-hop direct-transmission 
mode. It allows two mobile hosts to communicate through a third mobile host acting as a 
relay. Therefore, each mobile host must implement a neighbor database to record its current 



16 FUNDAMENTALS 



BS (receiver's) 




Figure 1.11 BS-oriented handoff to one-hop direct transmission. 

neighbors. (A neighbor of a mobile host is another mobile host that can be connected directly 
with radio waves and without the help of base stations.) Furthermore, we must handle the 
case when the two-hop direct-transmission connection is disrupted because of mobility, for 
example, the relay MH moves out of range or the destination moves out of range. With 
one-hop or two-hop direct transmission, the system reliability is increased since some 
mobile hosts can still communicate with others even if their base stations fail. However, we 
still limit the number of hops in direct transmission to two for the following reasons: 

(1) The routing will become complicated in three or more hops direct transmission. In 
multihop direct communications, handling many routing paths wastes the bandwidth 
in exchanging routing information, time stamps, avoiding routing update loop, and 
so on. 

(2) Problems of ad hoc networks, such as routing and connections maintenance, are 
more manageable. 

(3) If we allow three-hop direct transmission, the number of links in the air (at least 
three) not involving routing exchange will be larger than the BS-oriented mode 
(always two). In the long run, the battery power consumption will be more than the 
BS-oriented mode. 

1.3.4. 1 The neighbor database 

In two-hop direct-transmission mode, each MH maintains a simple database (see Table 1.1) 
to store the information of neighboring MHs within its radio covering area. Each mobile 
host must broadcast periodically to inform the neighboring MHs of its related information. 
For example, the neighbor database of MH1 in Figure 1.12 is shown in Table 1.1. In the 
table, the BS-down field indicates whether or not the neighboring mobile host can detect 
a nearby base station. A value True means that the mobile host cannot connect to a base 
station. 
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Table 1 . 1 The neighbor database for MH 1 in Figure 1.13 



Neighbor ID 


BS-down 


MH2 


False 


MH4 


False 


MH5 


False 




Figure 1.12 Two-hop direct-transmission zone. 
1.3.4.2 Two-hop direct-transmission mode 

This situation applies when the sender and destination are both within an intermediate's 
coverage area. The sender transmits the data to the destination through an intermediate MH. 
The connection setup procedures are as follows (see Figure 1.13): 

(1) If an MH is within the transmission range, it receives the sender's message. There are 
two cases: (a) when the receiver finds the destination is itself, it sends the message 
ACCEPT back to the sender; the connection will be the one-hop direct transmission; 
and (b) if the destination is not itself, the mobile host checks the neighbor database. 
If the destination is in the database, it forwards the SETUP message to make connec- 
tion to the destination. After the destination accepts the connection setup, it sends 
ACCEPT back to the sender. 

(2) If the destination receives many copies of SETUP, it only accepts the first one. The 
redundant messages are discarded. The other candidate intermediate nodes will give 
up after timeout. 

(3) At the same time, the BS receives the SEARCH message. It queries the MH and 
sends the SETUP message to the destination. For direct transmission, the destination 
receives the SETUP message and sends the DIRECT message to its BS. Otherwise, 
the destination accepts the connection from the base station. 



(4) The communication continues transmitting directly until the transmission path is 
broken. 
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Figure 1.13 Two-hop direct-transmission mode. 



1.3.4.3 Handoff - two-hop direct-transmission mode to two-hop 
direct-transmission mode, one-hop direction-transmission mode or 
BS-oriented mode 

If the destination or the intermediate node finds that the strength of the signal is less than 
a critical value, the handoff procedure is requested and executed at this time. The system 
will try to find another direct transmission path. If a direct transmission path is not found, 
the BS-oriented mode will take over. The handoff procedures are as follows: 

(1) The destination or intermediate node sends the CHANGE message to the sender for 
changing connection. 

(2) As the sender received the CHANGE request, it reinitiates the connection by send- 
ing out SEARCH. The next several steps are the same as in the initial connection 
setup. 



1.3.4.4 How to solve the problem of BS failure 

The method is also robust against BS failure in the middle of a connection. In Figure 1.14, 
when MH3 finds that its BS (BS2) failed, it performs the following steps until its BS is alive 
again (see Figure 1.15 for message flow). 

(1) MH3 broadcasts the WHOSE-BS-ALIVE message to the neighbors. 

(2) If one MH, say, MH2, receives the message and its BS is still alive, it records the 
sender ID and sends the BELONG message back to the sender. 
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Figure 1.14 The BS failure problem. 




Figure 1.15 MH whose BS failed uses neighbor as an agent. 



(3) MH3 receives the BELONG message and records MH2's ID. Then it sends the 
AGENT message to MH2. 

(4) After the agent (MH2) receives the AGENT message, it represents MH3 to register 
its location to its BS (BSl). 

(5) MH2 will now relay the information to and from MH3. When MH2 or MH3 is 
leaving the other's covering area, MH3 gives up the current agent and repeats 
steps 1-4 to find another agent. MH2 then removes the registration of MH3 from 
BSl. 
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1 .4 GREEN WIRELESS NETWORKS 

4G wireless networks might be using a spatial notching (angle a) to completely suppress 
antenna radiation towards the user, as illustrated in Figures. 1.16 and 1.17. These solutions 
will be referred to as 'green wireless networks' for obvious reasons. In a mobile environment 
in the periods when the notch coincides with the direction of the base station (access point) 
the multihop protocol, as discussed in the previous section, can be used. In addition, to reduce 
the overall transmit power, a cooperative transmit diversity, discussed in Section 19.4, and 
adaptive MAC protocol, discussed in Appendix B (please go to www.wiley.com/go/glisic), 
can be used. 




(b) 

Figure 1.16 Three-dimensional amplitude patterns of a two-element uniform amplitude 
array for d = 2k, directed towards (a) 9 0 = 0°, (b) 9 0 = 60°. 
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Figure 1.17 Three-dimensional amplitude patterns of a 10-element uniform amplitude 
array for d = k/4, directed towards (a) 9 0 = 0°, (b) 9 0 = 30°, (c) 6 0 = 60°, 
(d) 6 0 = 90°. 



22 FUNDAMENTALS 



REFERENCES 

[1] M. Mouly and M.-B. Pautet. The GSM System for Mobile Communications. Palaiseau: 
France, 1992. 

[2] R. Kalden, I. Meirick and M. Meyer. Wireless internet access based on GPRS, IEEE 

Pers. Commun., vol. 7, no. 2, 2000, pp. 8-18. 
[3] 3rd Generation Partnership Project (3GPP), www.3gpp.org 

[4] J. Khun-Jush, P. Schramm, G. Malmgren and J. Torsner. HiperLAN2: broadband 
wireless communications at 5 GHz. IEEE Commun. Mag., vol. 40, no. 6, 2002, 
pp. 130-137. 

[5] U. Varshney. The status and future of 802.1 1-based WLANs, IEEE Comput., vol. 36, 

no. 6, 2003, pp. 102-105. 
[6] Digital Video Broadcasting (DVB), www.dvb.org, January 2002. 
[7] S. Glisic. Advanced Wireless Communications: 4G Technology. John Wiley & Sons: 

Chichester, 2004. 

[8] J. Mitola III and G. Maguire Jr. Cognitive radio: making software radios more personal, 

IEEE Pers. Commun., vol. 6, no. 4, 1999, pp. 13-18. 
[9] J. Border et al. Performance enhancing proxies intended to mitigate link-related degra- 
dations. RFC 3135, June 2001. 

[10] D.C. Feldmeier, A,J. McAuley, J.M. Smith, D.S. Bakin, W.S. Marcus andT.M. Raleigh, 
Protocol boosters, IEEEJSAC, vol. 16, no. 3, 1998, pp. 437-444. 

[11] M. Garcia et al. An experimental study of Snoop TCP performance over the IEEE 
802.11b WLAN. 5th Int. Symp. Wireless Personal Multimedia Commun., Honolulu, 
HI, Vol. Ill, October 2002, pp. 1068-1072. 

[12] L. Munoz, M. Garcia, J. Choque, R. Aguero and P. Mahonen, Optimizing internet flows 
over IEEE 802. lib wireless local area networks: a performance enhancing proxy based 
on forward error correction, IEEE Commun. Mag., vol. 39, no. 12, 2001, pp. 60-67. 

[13] V. Jacobson. TCP/IP compression for low-speed serial links, RFC 1144, February 
1990. 

[14] Wireless LAN. IEEE Draft Standard P802.1 1, January 1996. 

[15] Radio equipment and systems (RES); High performance radio local area network 
(HIPERLAN); Functional specification. ETSI, France, Draft prETS 300 652, 1995. 

[16] D. Evans, Y. Du, C. Herrmann, S.N. Hulyalkar and P. May. Wireless ATM LAN with 
and without infrastructure. In 2nd IEEE Int. Workshop Broadband Switching Systems, 
Taipei, Taiwan, 1997, pp. 120-128. 

[17] 3GPP Technical Specification 25.401 UTRAN Overall Description. 

[18] 3GPP Technical Specification 25.410 UTRAN In Interface: General Aspects and Prin- 
ciples. 

[19] 3GPP Technical Specification 25.41 1 UTRAN Iu Interface: Layer 1. 
[20] 3GPP Technical Specification 25.412 UTRAN Iu Interface: Signalling Transport. 
[21] 3GPP Technical Specification 25.413 UTRAN Iu Interface: RANAP Signalling. 
[22] 3GPP Technical Specification 25.414 UTRAN Iu Interface: Data transport and Trans- 
port Signalling. 

[23] 3GPP Technical Specification 25.415 UTRAN Iu Interface: CN-RAN User Plane 
Protocol. 

[24] 3GPP Technical Specification 25.420 UTRAN Iur Interface: General Aspects and 
Principles. 

[25] 3 GPP Technical Specification 25.421 UTRAN Iur Interface: Layer 1. 



REFERENCES 23 



[26] 3 GPP Technical Specification 25.422 UTRAN Iur Interface: Signalling Transport. 

[27] 3 GPP Technical Specification 25.423 UTRAN Iur Interface: RNSAP Signalling. 

[28] 3GPP Technical Specification 25 .424 UTRAN Iur Interface: Data Transport and Trans- 
port Signalling for CCH Data Streams. 

[29] 3GPP Technical Specification 25.425 UTRAN Iur Interface: User Plane Protocols for 
CCH Data Streams. 

[30] 3GPP Technical Specification 25.426 UTRAN Iur and Iub Interface Data Transport 
and Transport Signalling for DCH Data Streams. 

[31] 3 GPP Technical Specification 25.427 UTRAN Iur and Iub Interface User Plane Pro- 
tocols for DCI-1 Data Streams. 

[32] 3 GPP Technical Specification 25.430 UTRAN Iub Interface: General Aspects and 
Principles. 

[33] 3 GPP Technical Specification 25.431 UTRAN Iub Interface: Layer 1. 

[34] 3 GPP Technical Specification 25.432 UTRAN Iub Interface: Signalling Transport. 

[35] 3 GPP Technical Specification 25.433 UTRAN Iub Interface: NBAP Signalling. 

[36] 3GPP Technical Specification 25 .434 UTRAN Iub Interface: Data Transport and Trans- 
port Signalling for CCH Data Streams. 

[37] 3 GPP Technical Specification 25.435 UTRAN Iub Interface: User Plane Protocols for 
CCH Data Streams. 

[38] 3G TS 25.301 Radio Interface Protocol Architecture. 

[39] 3G TS 25.302 Services Provided by the Physical Layer. 

[40] 3G TS 25.303 UE Functions and Interlayer Procedures in Connected Mode. 

[41] 3G TS 25.304 UE Procedures in Idle Mode. 

[42] 3G TS 25.321 MAC Protocol Specification. 

[43] 3G TS 25.322 RLC Protocol Specification. 

[44] 3G TS 25.323 PDCP Protocol Specification. 

[45] 3G TS 25.324 Broadcast/Multicast Control Protocol (BMC) Specification. 
[46] 3G TS 25.331 RRC Protocol Specification. 

[47] 3G TS 24.008 Mobile Radio Interface Layer 3 Specification, Core Network 

Protocols - Stage 3. 
[48] 3G TS 33.102 3G Security; Security Architecture. 

[49] GSM 04.18 Digital Cellural Telecommunications System (Phase 2+); Mobile Radio 

Interface Layer 3 Specification, Radio Resource Control Protocol. 
[50] IETF RFC 2507 IP Header Compression. 

[51] 3G TS 25.305 Stage 2 Functional Specification of Location Services in UTRAN. 

[52] 3G TS 33.105 3G Security; Cryptographic Algorithm Requirements. 

[53] G. Armitage and K. Adams. Packet reassembly during cell loss, IEEE Network, 

vol. 7, no. 5, 1995, pp. 26-34. 
[54] U. Black. ATM Volume I: Foundation for Broadband Networks. Prentice-Hall: Upper 

Saddle River, NJ, 1992. 
[55] M. Garrett. A service architecture for ATM: from applications to scheduling, IEEE 

Network, vol. 10, no. 3, 1996, pp. 6-14. 
[56] D. McDysan and D. Spohn. ATM: Theory and Application. McGraw-Hill: New York, 

1999. 

[57] K. Sato, S. Ohta and I. Tokizawa. Broad-band ATM network architecture based on 
virtual paths, IEEE Trans. Commun., vol. 38, no. 8, 1990, pp. 1212-1222. 

[58] T. Suzuki. ATM adaptation layer protocol, IEEE Commun. Maga., vol. 32, no. 4, 1994, 
80-83. 



2 



Physical Layer and 
Multiple Access 

In this chapter we will briefly summarize the signal formats used in the existing wireless 
systems and point out possible ways of evolution towards the 4G system. The focus will be 
on ATDMA, WCDMA, OFDMA, MC CDMA and UWB signals [1-54]. 

2.1 ADVANCED TIME DIVISION MULTIPLE ACCESS-ATDMA 

In a TDMA system each user is using a dedicated time slot within a TDMA frame as 
shown in Figure 2.1 for GSM or in Figure 2.2 for ADC (american digital cellular system). 
Additional data about the signal format and system capacity are given in Tables 2. 1 and 
2.2. The evolution of the ADC system resulted in the TIA (Telecommunications Industry 
Association) universal wireless communications (UWC) standard 136. The basic system 
parameters are summarized in Table 2.3. The evolution of GSM resulted in a system known 
as EDGE with parameters also summarized in Table 2.3. 

If TDMA is chosen for 4G, the signal formats are further enhanced by using multidi- 
mensional trellis (space-time-frequency) coding and advanced signal processing [54]. This 
is also combined with Orthogonal frequency division multiplex (OFDM) and Multicarrier 
code division multiple access(MC CDMA) signal formats described below. 

2.2 CODE DIVISION MULTIPLE ACCESS 

Code division multiple access (CDMA) technique is based on spreading the spectra of the 
relatively narrow information signal S n by a code c, generated by much higher clock (chip) 
rate. Different users are separated using different uncorrelared codes. As an example, the 
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Figure 2. 1 Digital cellular TDMA systems: GSM slot and frame structure showing 130.25 
bits/time slot (0.577 ms), eight time slots/TDMA frame (full rate) and 13 TDMA 
frame s/multiframe (TB = tail bits, GP = guard period, SF = stealing flag). 
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Figure 2.2 ADC slot and frame structure for down- and uplink with 324 bits/time slot 
(6.67 ms) and 3(6) time slots/TDMA frame for full-rate (half-rate) (G = guard 
time, R = ramp-up time, RSVD = reserved bits). 



narrowband signal in this case can be a PSK signal of the form 

S n = b(t, T m ) cos cot 



(2.1) 



where \/T m is the bit rate and b — ±1 is the information. The baseband equivalent of 
Equation (1.1) is 



S h n = b(t, T m ) 



(2.1a) 
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Table 2. 1 TDMA system parameters 







North 






Europe (ETSI) 


America (TIA) 


Japan (MPT) 


Access method 


TDMA 


TDMA 


TDMA 


Carrier spacing 


200 kHz 


30 kHz 


25 kHz 


Users per carrier 


8(16) 


3(6) 


3 (tbd) 


Modulation 


GMSK 


tt/4 DPSK 


tt/4 DPSK 


Voice codec 


RPE 13 kb/s 


VSELP 8kb/s 


tbd 


Voice frame 


20 ms 


20 ms 


20 ms 


Channel code 


Convolutional 


Convolutional 


Convolutional 


Codec bit rate 


22.8 kb/s 


1 3 kh/s 


1 1 7 kh/s 

11.— JvU/ S 


TDMA frame duration 


4.6 ms 


20 ms 


20 ms 


Interleaving 


~40ms 


27 ms 


27 ms 


Associated control channel Extra slot 


In slot 


In slot 


Handoff method 


MAHO 


MAHO 


MAHO 


ETSI, European Telecommunications Standards Institute; MPT, Mobile portable terminal; 


TDMA, time division multiple access. 






Table 2.2 Approximate capacity in Erlang per km 2 


assuming a cell radius of 1 km (site 


distance of 3 km) in all cases and three sectors per site. The Lee-Merit is number 


of channels per site assuming an optimal r 


euse plan 






GSM 


ADC 


JDC 


Analog pessimistic 


pessimistic 


pessimistic 


FM optimistic 


optimistic 


optimistic 


Bandwidth 25 MHz 25 MHz 


25 MHz 


25 MHz 


Number of voice 833 1000 


2500 


3000 


channels 








Reuse plan ' 


1 4 3 


7 4 


7 4 


Channels/site 119 250 333 


357 625 


429 750 


Erlang/km 2 1 1 


.9 27.7 40.0 


41.0 74.8 


50.0 90.8 


Capacity gain 1.0 2.3 3.4 


3.5 6.3 


4.2 7.6 


(Lee-Merit gain) (1.0) (2.7) (3.4) 


(3.8) (6.0) 


(4.0) (7.2) 



The spreading operation, presented symbolically by operator e( ), is obtained if we multiply 
narrowband signal by a pseudo noise (PN) sequence (code) c(f , Tc) — ±1. The bits of the 
sequence are called chips and the chip rate 1/T C ^> 1/T m . The wideband signal can be 
represented as 

5 W = s(S n ) — cS n — c(t, T c )b(t, T m ) cos cot (2.2) 
The baseband equivalent of Equation (2.2) is 



Si = c(t, T c )b(t, T m ) 



(2.2a) 
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Table 2.3 


Parameters of UWC-136 and EDGE signal 






GSM radio interface 


Key characteristic 


TIA UWC-136 


(for reference only) 


Multiple access 


TDMA 


TDMA 


Band width 


30/200/1600 kHz 


200 kHz 


Bit rate 


48.6 kb/s 


270.8kb/s 




72.9 kb/s 


for EDGE 812.5 kb/s 




270.8 kb/s 






361.1 kb/s 






722.2 kb/s 






2.6 Mb/s 






5.2 Mb/s 




Duplexing 


FDD/TDD 


FDD 


Carrier spacing 


30/200/1600 kHz 


200 kHz 


Inter BS timing 


Asynchronous 


Asynchronous 




(synchronization possible) 


(synchronization 
possible) 


Inter-cell synchronization 


Not required 


Not required 


Base station synchronization 


Not required 


Not required 


Cell search scheme 


LI power-based, L2 


LI power-based, 



Frame length 
HO 

DL Data modulation 



DL Power control 
DL Variable rate 

accommodation 
UL Data modulation 



UL Power control 



parameter-based, 
L3 service/network/ 
operator-based 

40/40/4.6/4.6 ms 

HHO 

jr/4 DPSK 

ii I A coherent 

QPSK 

8PSK 

GMSK 

Q-O-QAM 

B-O-QAM 

Per slot and per carrier 
Slot aggregation 

jr/4 DPSK 

jr/4 coherent QPSK 

8PSK 

GMSK 

Q-O-QAM 

B-O-QAM 

Per slot and per carrier 



L2 parameter-based, 
L3 service/network/ 
operator-based 

4.6 ms 

HHO 

GMSK 

8PSK 



Per slot 

Slot aggregation 

GMSK 
8PSK 



BS-directed MS power 
control 



(Continued ) 
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Table 2.3 (Continued) 



Key characteristic 



TIA UWC-136 



GSM radio interface 
(for reference only) 



UL Variable rate 

accommodation 
Channel coding 



Interleaving periods 
Rate detection 
Other features 



Random access mechanism 



Power control steps 
Super frame length 

Slots/frame 



Focus of backward 
compatibility 



Slot aggregation 

Punctured convolutional code 

(R= 1/2, 2/3,3/4, 1/1) 

Soft or hard decision coding 
0/20/40/140/240 ms 
Via L3 signaling 
Space and frequency 

diversity; MRC/ 

'MRC-like' 
Support for hierarchical 

structures 
Random access with shared 

control feedback (SCF), 

also reserved access 
4dB 

720/640 ms (hyperframe is 

1280 ms) 
6 per 30 kHz carrier 8 per 

200 kHz carrier 16-64 per 

1.6 MHz carrier 
AMPS/IS54/136/GSM 



Slot aggregation 

Convolutional coding 
Rate dependent on 

service 
Dependent on service 
Via stealing flags 
MRC 



Random 



2dB 

720 ms 



8 



GSM 



HHO, hard handoff; DL, downlink; UL, uplink. 



Despreading, represented by operator D( ), is performed if we use e( ) once again and 
bandpass filtering, with the bandwidth proportional to 2/T m represented by operator BPF( ), 
resulting in 

D(Sw) = BPF[e(Sw)] = BPF(cc b cos cot) = BPF(c 2 b cos cot) = b cos cot (2.3) 

The baseband equivalent of Equation (2.3) is 

D (Si) = LPF[s (Si)] = LPF[c(t, T c )c(t, T c )b(t, T m )] = LPF[b(t, T m )] = b(t, T m ) 

(2.3a) 

where LPF( ) stands for low pass filtering. This approximates the operation of correlating 
the input signal with the locally generated replica of the code Cor(c, 5 W ). Nonsynchronized 
despreading would result in 

D T ( );Cor(c r , S w ) = BPF[e T (S w )] = BPF(c T d7Cos&;r) = p(r)b cos cot (2.4) 
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In Equation (2.4) BPF would average out the signal envelope c T c, resulting in E(c T c) = 
p(r). The baseband equivalent of Equation (2.4) is 



D T { ); Cor (c T , S b w ) = Jc r S^dt = b(t, T m ) j c T c At = bp{x) (2.4a) 

o o 

This operation extracts the useful signal b as long as r = 0, otherwise the signal will be 
suppressed because p(r) = 0 for r > T c . Separation of multipath components in a RAKE 
receiver is based on this effect. In other words, if the received signal consists of two delayed 
replicas of the form 

r = S\{t) + S h w (t - r) 
the despreading process defined by Equation (2.4a) would result in 

D T ( ); Cor(c, r) = j cr df = b(t, T m ) j c(c + c r ) dt = bp(0) + bp(r) 
o o 

Now, if p(r) = 0 for r > T c , all multipath components reaching the receiver with a delay 
larger then the chip interval will be suppressed. If the signal transmitted by user y is despread 
in receiver x the result is 

Dxy{ ); BPF[s xy (S w )] = BPF(c x c y b y cos cot) = p xy (t)b y cos cot (2.5) 

So in order to suppress the signals belonging to other users (multiple access interference, 
MAI), the cross-correlation functions should be low. In other words if the received signal 
consists of the useful signal plus the interfering signal from the other user: 

r = S\ x (t) + Sl y (t) = b x c x + b y c y (2.6) 

the despreading process at receiver of user x would produce 

T m T m T m 

D xy { ); Cor(c x , r) = j c x r dt = b x j c x c x dt + b y j c x c y dt = b x p x (0) + b y p xy (0) 
ooo 

(2.7) 

When the system is properly synchronized p x (0) = 1 , and if p xy (0) = 0 the second compo- 
nent representing MAI will be suppressed. This simple principle is elaborated in WCDMA 
standard, resulting in a collection of transport and control channels. The system is based 
on 3.84 Mcips rate and up to 2 Mb/s data rate. In a special downlink high data rate shared 
channel the data rate and signal format are adaptive. There will be mandatory support for 
QPSK and 16 QAM and optional support for 64 QAM based on UE capability, which will 
proportionally increase the data rate. For details see www.3gpp.com. CDMA is discussed 
in detail in Glisic [54, 55]. 



2.3 ORTHOGONAL FREQUENCY DIVISION MULTIPLEXING 



In wireless communications, the channel imposes the limit on data rates in the system. 
One way to increase the overall data rate is to split the data stream into a number of 
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Figure 2.4 Spectrum overlap in OFDM. 



parallel channels and use different subcarriers for each channel. The concept is presented in 
Figure 2.3 and represents the basic idea of the orthogonal frequency division multiplexing 
(OFDM) system. The overall signal can be represented as 

x^f^V"^"™}; ~ k i<t< 1 ^ (2.8) 
n=o J s 

In other words complex data symbols [Do, D\ D^-x] are mapped in OFDM symbols 

[do, d\, dn-\] such that 



d k = D„d 2 ^ N) 

n=0 



(2.9) 
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Figure 2.5 Basic OFDM system. 



The output of the FFT block at the receiver produces data per channel. This can be 
represented as 



j N-i 

k=0 



-j2nm(k/2N) 



I'k 



D„ 



n=0 

H n D n + N(n), n — m 
N(n) ,n^m 



(2.10) 



The system block diagram is given in Figure 2.6. In order to eliminate residual inter- 
symbol interference, a guard interval after each symbol is used as shown in Figure 2.7. An 
example of OFDM signal specified by IEEE 802.1 la standard is shown in Figure 2.8. The 
signal parameters are: 64 points FFT, 48 data subcarriers, four pilots, 12 virtual subcarriers, 
DC component 0, guard interval 800 ns. Discussion on OFDM and an extensive list of 
references on the topic are included in Glisic [54]. 



2.4 MULTICARRIER CDMA 

Good performance and flexibility to accommodate multimedia traffic are incorporated in 
multicarrier (MC) CDMA, which are obtained by combining CDMA and OFDM sig- 
nal formats. Figure 2.9 shows the DS-CDMA transmitter of the jth user for binary 
phase shift keying/coherent detection (CBPSK) scheme and the power spectrum of the 
transmitted signal, respectively, where Gds = T m /T c denotes the processing gain and 
C J (t) = [C{ C J 2 --- C j Gds ] the spreading code of the jth user. Figure 2.10 shows the MC- 
CDMA transmitter of the y'th user for CBPSK scheme and the power spectrum of the 
transmitted signal, respectively, where Gmc denotes the processing gain, Nc the number 
of subcarriers, and C'(f) = [C\ C J 2 ■ ■ ■ C j Gmc ] the spreading code of the jth user. The MC- 
CDMA scheme is discussed assuming that the number of subcarriers and the processing 
gain are the same. 

However, we do not have to choose Afc = Gmc, and actually, if the original symbol 
rate is high enough to become subject to frequency selective fading, the signal needs to 
be first S/P-converted before spreading over the frequency domain. This is because it is 
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crucial for the multicarrier transmission to have frequency nonselective fading over each 
subcarrier. 

Figure 2.11 shows the modification to ensure frequency nonselective fading, where Ts 
denotes the original symbol duration, and the original data sequence is first converted into 
P parallel sequences, and then each sequence is mapped onto Gmc subcarriers (A^c = 
P x Gmc)- 
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Figure 2.10 MC-CDMA scheme. 
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Figure 2.1 1 Modification of MC-CDMA scheme: spectrum of its transmitted signal. 



The multicarrier DS-CDMA transmitter spreads the S/P-converted data streams using a 
given spreading code in the time domain so that the resulting spectrum of each subcarrier can 
satisfy the orthogonality condition with the minimum frequency separation. This scheme 
was originally proposed for an uplink communication channel, because the introduction 
of OFDM signaling into the DS-CDMA scheme is effective for the establishment of a 
quasi-synchronous channel. 

Figure 2. 12 shows the multicarrier DS-CDMA transmitter of the 7 th user and the power 
spectrum of the transmitted signal, respectively, where Gmd denotes the processing gain, 
Nc the number of subcarriers, and C-'(f) = [C{ ■ ■ ■ C' Gmd \ the spreading code of the y'th 
user. 

The multitone MT-CDMA transmitter spreads the S/P-converted data streams using a 
given spreading code in the time domain so that the spectrum of each subcarrier prior to 
spreading operation can satisfy the orthogonality condition with the minimum frequency 
separation. Therefore, the resulting spectrum of each subcarrier no longer satisfies the 
orthogonality condition. The MT-CDMA scheme uses longer spreading codes in proportion 
to the number of subcarriers, as compared with a normal (single carrier) DS-CDMA scheme; 
therefore, the system can accommodate more users than the DS-CDMA scheme. 

Figure 2.13 shows the MT-CDMA transmitter of the y'th user for the CBPSK scheme 
and the power spectrum of the transmitted signal, respectively, where Gmt denotes the 
processing gain, A^c the number of subcarriers, and C-*(t) — [C{ C ] 2 ■ ■ ■ C ] G ] the spreading 
code of the y'th user. All these schemes will be discussed in details in Glisic [54]. 
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Figure 2.12 Multicarrier DS-CDMA scheme. 
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Figure 2.13 MT-CDMA scheme. 



2.5 ULTRAWIDE BAND SIGNAL 

For the multipath resolution in indoor environments a chip interval of the order of few 
nanoseconds is needed. This results in a spread spectrum signal with the bandwidth in the 
order of few GHz. Such a signal can also be used with no carrier, resulting in what is called 
impulse radio (IR) or ultrawide band (UWB) radio. A Typical form of the signal used in this 
case is shown in Figure 2.14. A collection of pulses received on different locations within 
the indoor environment is shown in Figure 2.16. UWB radio is discussed in detail in Glisic 
[54]. In this section we will define only a possible signal format. 





Figure 2.15 Modified Hermite pulse with Gram-Schimidt orthogonalization. (a) Generated 
pulses; (b) transmit pulses. 
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Figure 2. 16 A collection of received pulses in different locations. (Reproduced by permis- 
sion of IEEE [53].) 
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Figure 2.17 A collection of channel delay profiles. (Reproduced by permission of IEEE 
[52].) 

A typical time-hopping format used in this case can be represented as 

CO 

£ ^«-,T f -cfr c - H %) (2.ii) 

/=-oo 

where t ^ is the kth transmitter's clock time and Tf is the pulse repetition time. The trans- 
mitted pulse waveform &>t r is referred to as a monocycle. To eliminate collisions due to 
multiple access, each user (indexed by k) is assigned a distinctive time-shift pattern {cp} 
called a time-hopping sequence. This provides an additional time shift of c~^T c seconds 
to the y'th monocycle in the pulse train, where T c is the duration of addressable time de- 
lay bins. For a fixed Tf the symbol rate R s determines the number A^ s of monocycles that 
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are modulated by a given binary symbol as R s — (1 /N s Tf) s . The modulation index S is 
chosen to optimize performance. For performance prediction purposes, most of the time the 
data sequence {dj }JL_ 00 is modeled as a wide-sense stationary random process composed 
of equally likely symbols. For data a pulse position data modulation is used. 

When K users are active in the multiple-access system, the composite received signal at 
the output of the receiver's antenna is modeled as 

K 

r(t) = J2 Mec H -r k ) + n (t) (2. 12) 

k=\ 

The antenna/propagation system modifies the shape of the transmitted monocycle w ti (t) 
to a> rec (f ) on its output. An idealized received monocycle shape a> lec (f ) for a free- space 
channel model with no fading is shown in Figure 2.14. 

The optimum receiver for a single bit of a binary modulated impulse radio signal in 
additive white Gaussian noise (AWGN) is a correlation receiver 

'decide d^ = 0' if 

Pulse correlator output = aj(u) 

T / r{u,t)v(t-r 1 -jT t -c?T e )dt>0 (Z13) 

Test statistic = a(«) 

where u (t) = a> rec (t) — a) rec (t — 8). 

The spectra of a signal using TH are shown in Figure 2.18. If instead of TH a DS 
signal is used, the signal spectra is shown in Figure 2.19(a) for pseudorandom code and 
Figure 2.19(b) for a random code. The FCC (Frequency Control Committee) mask for 
indoor communications is shown in Figure 2.18. Possible options for UWB signal spectra 
are given in Figures 2.21 and 2.22 for single band and Figure 2.23 for multiband signal 
format. For more details see www.uwb.org and www.uwbmultiband.org. 




Figure 2.18 Spectra of a TH signal. 
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Figure 2.20 FCC frequency mask. 



The optimal detection in a multiuser environment, with knowledge of all time-hopping 
sequences, leads to complex parallel receiver designs [2]. However, if the number of users 
is large and no such multiuser detector is feasible, then it is reasonable to approximate 
the combined effect of the other users' dehopped interfering signals as a Gaussian random 
process. All details regarding the system performance can be found in Glisic [54]. 
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Figure 2.21 FCC mask and possible UWB signal spectra. 
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Figure 2.22 Single-band UWB signal. 



2.6 MIMO CHANNELS AND SPACE TIME CODING 

In order to increase the capacity, 4G networks use the previously described signal formats in 
MIMO (multiple input multiple output) channels combined with space-time coding. More 
details on these technologies can be found in Glisic [54]. 
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Figure 2.23 Multiband UWB signal. 
REFERENCES 

[1] F. Adachi, Evolution towards broadband wireless systems, in 5th Int. Symp. Wireless 
Personal Multimedia Communications, Honolulu, HI, vol. 1, 27-30 October 2002, 
pp. 19-26. 

[2] Jun-Zhao Sun, J. Sauvola and D. Howie, Features in future: 4G visions from a technical 
perspective, IEEE Global Telecommunications Conference, GLOBECOM '01, vol. 6, 
25-29 November 2001, pp. 3533-3537. 

[3] D.I. Axiotis, F.I. Lazarakis, C. Vlahodimitropoulos and A. Chatzikonstantinou, 4G 
system level simulation parameters for evaluating the interoperability of MTMR in 
UMTS and HIPERLAN/2, in 4th Int. Workshop on Mobile and Wireless Communica- 
tions Network, 9-1 1 September 2002, pp. 559-563. 

[4] A. Mihovska, C. Wijting, R. Prasad, S. Ponnekanti, Y. Awad, and M. Nakamura, 
A novel flexible technology for intelligent base station architecture support for 4G 
systems, in 5th Int. Symp. Wireless Personal Multimedia Communications, vol. 2, 
27-30 October 2002, pp. 601-605. 

[5] D. Kitazawa, L. Chen, H. Kayama and N. Umeda, Downlink packet-scheduling con- 
sidering transmission power and QoS in CDMA packet cellular systems, in 4th Int. 
Workshop on Mobile and Wireless Communications Network, 9-1 1 September, 2002, 
pp. 183-187. 

[6] L. Dell'Uomo and E. Scarrone, An all-IP solution for QoS mobility management and 
AAA in the 4G mobile networks, in 5th Int. Symp. Wireless Personal Multimedia 
Communications, vol. 2, 27-30 October 2002, pp. 591-595. 

[7] E.R. Wallenius, End-to-end in-band protocol based service quality and transport QoS 
control framework for wireless 3/4G services, in 5th Int. Symp. Wireless Personal 
Multimedia Communications, vol. 2, 27-30 October 2002, pp. 531-533. 

[8] M. Benzaid, P. Minet and K. Al Agha, Integrating fast mobility in the OLSR routing 
protocol, in 4th Int. Workshop on Mobile and Wireless Communications Network, 9-11 
September 2002, pp. 217-221. 

[9] G. Kambourakis, A. Rouskas and S. Gritzalis, Using SSL/TLS in authentication and 
key agreement procedures of future mobile networks, in 4th Int. Workshop on Mobile 
and Wireless Communications Network, 9-11 September 2002, pp. 152-156. 



REFERENCES 43 



[10] M. van der Schaar and J. Meehan, Robust transmission of MPEG-4 scalable video 
over 4G wireless networks, in Proc. Int. Conf. Image Processing, 24-28 June 2002, 
pp. 757-760. 

[11] Qing-Hui Zeng, Jian-Ping Wu, Yi-Lin Zeng, Ji-Long Wang and Rong-Hua Qin, Re- 
search on controlling congestion in wireless mobile Internet via satellite based on 
multi-information and fuzzy identification technologies, in Proc. 2002 Int. Conf. Ma- 
chine Learning and Cybernetics, vol. 4, 4-5 November 2002, pp. 1697-1701. 

[12] Proc. 5th Int. Symp. Wireless Personal Multimedia Communications, (catalog no. 
02EX568), vol. 1, 27-30 October 2002. 

[13] T. Sukuvaara, P. Mahonen and T. Saarinen, Wireless Internet and multimedia services 
support through two-layer LMDS system, in IEEE Int. Workshop on Mobile Multime- 
dia Communications (MoMuC '99), 15-17 November 1999, pp. 202-207. 

[14] C.C. Martin, J.H. Winters and N.R. Sollenberger, Multiple-input multiple-output 
(MIMO) radio channel measurements, in Proc. 2000 IEEE Sensor Array and Mul- 
tichannel Signal Processing Workshop, 16-17 March 2000, pp. 45^16. 

[15] J.M. Pereira, Fourth generation: now, it is personal!, in 11th IEEE Int. Symp. Personal, 
Indoor and Mobile Radio Communications, PIMRC 2000, vol. 2, 18-21 September 
2000, pp. 1009-1016. 

[16] T. Otsu, N. Umeda and Y. Yamao, System architecture for mobile communications 
systems beyond IMT-2000, in IEEE Global Telecommunications Conference, GLOBE- 
COM '01, vol. 1, 25-29 November 2001, pp. 538-542. 

[17] Yi Han Zhang, D. Makrakis, S. Primak and Yun Bo Huang, Dynamic support of service 
differentiation in wireless networks 2002, in IEEE CCECE 2002. Canadian Conf. on 
Electrical and Computer Engineering, vol. 3, 12-15 May 2002, pp. 1325-1330. 

[ 1 8] Jun-Zhao Sun and J. Sauvola, Mobility and mobility management: a conceptual frame- 
work, in 10th IEEE Int. Conf. Networks, ICON 2002, 27-30 August 2002, pp. 205-2 10. 

[19] V. Vassiliou, H.L. Owen, D.A. Barlow, J. Grimminger, H.-P. Huth and J. Sokol, A 
radio access network for next generation wireless networks based on multi-protocol 
label switching and hierarchical mobile IP, in Proc. IEEE 56th Vehicular Technology 
Conference, VTC 2002, vol. 2, 24-28 September 2002, pp. 782-786. 

[20] P. Nicopolitidis, G.I. Papadimitriou, M.S. Obaidat and A.S. Pomportsis, 3G wireless 
systems and beyond: a review, in 9th Int. Conf. Electronics, Circuits and Systems, vol. 
3, 15-18 September 2002, pp. 1047-1050. 

[21] Proc. IEEE Wireless Communications and Networking Conference, WCNC 2002 (cat- 
alog, no. 02TH8609), vol. 1,17-21 March 2002. 

[22] J. Borras-Chia, Video services over 4G wireless networks: not necessarily streaming, 
in IEEE Wireless Communications and Networking Conf, WCNC2002, vol. 1, 17-21 
March 2002, pp. 18-22. 

[23] B.G. Evans and K. Baughan, Visions of 4G, Electron. Commun. Eng. J., vol. 12, no. 
6, 2000, pp. 293-303. 

[24] J. Kim and A. Jamalipour, Traffic management and QoS provisioning in future wireless 
IP networks, IEEE Person. Commun. (see also IEEE Wireless Commun.), vol. 8, no. 
5, 2001, pp. 46-55. 

[25] A.H. Aghvami, T.H. Le and N. Olaziregi, Mode switching and QoS issues in software 
radio, IEEE Person. Commun. (see also IEEE Wireless Commun.), vol. 8, no. 5, 2001, 
pp. 38^44. 



44 PHYSICAL LAYER AND MULTIPLE ACCESS 



[26] T. Kanter, An open service architecture for adaptive personal mobile communica- 
tion, IEEE Person. Commun. (see also IEEE Wireless Commun.), vol. 8, no. 6, 2001, 
pp. 8-17. 

[27] H. Sampath, S. Talwar, J. Tellado, V. Erceg and A. Paulraj, A fourth-generation MIMO- 

OFDM broadband wireless system: design, performance, and field trial results, IEEE 

Commun. Mag., vol. 40, no. 9, 2002, pp. 143-149. 
[28] W. Kellerer and H.-J. Vogel, A communication gateway for infrastructure-independent 

4G wireless access, IEEE Commun. Mag., vol. 40, no. 3, 2002, pp. 126-131. 
[29] V. Huang and Weihua Zhuang, QoS-oriented access control for 4G mobile multimedia 

CDMA communications, IEEE Commun. Mag., vol. 40, no. 3, 2002, pp. 1 18-125. 
[30] P. Smulders, Exploiting the 60 GHz band for local wireless multimedia access: 

prospects and future directions, IEEE Commun. Mag., vol. 40, no. 1, 2002, pp. 140- 

147. 

[31] Y. Raivio, 4G-hype or reality, in Second Int. Conf. 3G Mobile Communication Tech- 
nologies. (Conference Publication no. 477), 26-28 March 2001, pp. 346-350. 

[32] L. Becchetti, F. Delli Priscoli, T. Inzerilli, P. Mahonen and L. Munoz, Enhancing 
IP service provision over heterogeneous wireless networks: apath toward 4G, IEEE 
Commun. Mag., vol. 39, no. 8, 2001, pp. 74-81. 

[33] T. Abe, H. Fujii and S. Tomisato, A hybrid MIMO system using spatial correlation, in 
5th Int. Symp. Wireless Personal Multimedia Communications, vol. 3, 27-30 October 
2002, pp. 1346-1350. 

[34] S. Lincke-Salecker and C.S. Hood, A supernet: engineering traffic across network 
boundaries, in 36th Annual Simulation Symp., 30 March-2 April 2003, pp. 1 17-124. 

[35] Y. Yamao, H. Suda, N. Umeda and N. Nakajima, Radio access network design concept 
for the fourth generation mobile communication system, in Proc. IEEE 51st Vehicular 
Technology Conf., VTC 2000, Tokyo, vol. 3, 15-18 May 2000, pp. 2285-2289. 

[36] E. Ozturk and G.E. Atkin, Multi-scale DS-CDMA for 4G wireless systems, IEEE 
Global Telecommunications Conf., GLOBECOM '01, vol. 6, 25-29 November 2001, 
pp. 3353-3357. 

[37] L. Dell'Uomo and E. Scarrone, The mobility management and authentica- 
tion/authorization mechanisms in mobile networks beyond 3G, in 12th IEEE Int. Symp. 
Personal, Indoor and Mobile Radio Communications, vol. 1, 30 September-3 October 

2001, pp. C-44-C-48. 

[38] K.J. Kumar, B.S. Manoj and C.S.R. Murthy, On the use of multiple hops in next 
generation cellular architectures, in 10th IEEE Int. Conf. Networks, ICON 2002, 27- 
30 August 2002, pp. 283-288. 

[39] S.S. Wang, M. Green and M. Malkawi, Mobile positioning and location services, in 
IEEE Radio and Wireless Conf. RAWCON, 1 1-14 August 2002, pp. 9-12. 

[40] M. Motegi, H. Kayama and N. Umeda, Adaptive battery conservation management 
using packet QoS classifications for multimedia mobile packet communications, in 
Proc. 56th IEEE Vehicular Technology Conf. VTC 2002, vol. 2, 24-28 September 

2002, pp. 834-838. 

[41] Ying Li, Shibua Zhu, Pinyi Ren and Gang Hu, Path toward next generation wire- 
less internet-cellular mobile 4G, WLAN/WPAN and IPv6 backbone, in Proc. 2002 
IEEE Region 10 Conf. Computers, Communications, Control and Power Engineering 
TENCOM '02, vol. 2, 28-31 October 2002, pp. 1146-1149. 



REFERENCES 45 



[42] R.C. Qiu, Wenwu Zhu and Ya-Qin Zhang, Third-generation and beyond (3.5G) wire- 
less networks and its applications, in IEEE Int. Symp. Circuits and Systems, ISCAS 
2002, vol. 1, 26-29 May 2002, pp. I-41-I-44. 

[43] C. Bornholdt, B. Sartorius, J. Slovak, M. Mohrle, R. Eggemann, D. Rohde and G. 
Grosskopf, 60 GHz millimeter-wave broadband wireless access demonstrator for the 
next-generation mobile internet, in Optical Fiber Communication Conf. and Exhibit, 
OFC, 17-22 March 2002, pp. 148-149. 

[44] Jianhua He, Zongkai Yang, Daiqin Yang, Zuoyin Tang and Chun Tung Chou, Inves- 
tigation of JPEG2000 image transmission over next generation wireless networks, in 
5th IEEE Int. Conf. High Speed Networks and Multimedia Communications, 3-5 July 
2002, pp. 71-77. 

[45] E. Baccarelli and M. Biagi, Error resistant space-time coding for emerging 4G- 
WLANs, in IEEE Wireless Communications and Networking, WCNC 2003, vol. 1, 
16-20 March 2003, pp. 72-77. 

[46] W. Mohr, WWRF - the Wireless World Research Forum, Electron. Commun. Eng. J., 
vol. 14, no. 6, 2002, pp. 283-291. 

[47] T. Otsu, I. Okajima, N. Umeda and Y. Yamao. Network architecture for mobile commu- 
nications systems beyond IMT-2000, IEEE Person. Commun. (see also IEEE Wireless 
Commun.), vol. 8, no 5, October 2001, pp. 31-37. 

[48] A. Bria, F. Gessler, O. Queseth, R. Stridh, M. Unbehaun, Jiang Wu, J. Zander and 
M. Flament, 4th- Generation wireless infrastructures: scenarios and research chal- 
lenges, IEEE Person. Commun. (see also IEEE Wireless Commun.), vol. 8, no 6, 2001, 
pp. 25-31. 

[49] F. Fitzek, A. Kopsel, A. Wolisz, M. Krishnam and M. Reisslein, Providing application- 
level QoS in 3G/4G wireless systems: a comprehensive framework based on multirate 
CDMA, IEEE Wireless Commun. (see also IEEE Person. Commun.), vol. 9, no 2, April 
2002, pp. 42^17. 

[50] B. Classon, K. Blankenship and V. Desai, Channel coding for4G systems with adaptive 

modulation and coding, IEEE Wireless Commun. (see also IEEE Person. Commun.), 

vol. 9, no. 2, 2002, pp. 8-13. 
[51] Yile Guo and H. Chaskar, Class-based quality of service over air interfaces in 4G 

mobile networks, IEEE Commun. Mag., vol. 40, on 3, 2002, pp. 132-137. 
[52] D. Cassioli, M.Z. Win and A.F Molisch, The ultra-wide bandwidth indoor channel: 

from statistical model to simulations, IEEE J. Selected Areas in Commun., vol. 20, 

no. 6, 2002, pp. 1247-1257. 
[53] M.Z. Win and R.A. Scholtz, Characterization of ultra-wide bandwidth wireless indoor 

channels: a communication-theoretic view, IEEE J. Selected Areas in Communi., vol. 

20, no. 9, 2002, pp. 1613-1627. 
[54] S. Glisic, Advanced Wireless Communications, 4G Technology. John Wiley & Sons 

Ltd: Chichester, 2004. 

[55] S. Glisic, Adaptive WCDMA, Theory and Practice. John Wiley & Sons Ltd Chich- 
ester, 2004. 




Channel Modeling for 4G 



3.1 MACROCELLULAR ENVIRONMENTS (1 .8 GHz) 



In this section we briefly present statistical properties of azimuth and delay spread in 
macrocellular environments. For more details see Glisic [1]. The analysis is based on data 
reported from a measurement campaign in typical urban, bad urban and suburban (SU) 
[3] areas. In the experiment a base station (BS) equipped with an eight-element uniform 
linear antenna array and a mobile station (MS) with an omnidirectional dipole antenna 
are used. The MS is equipped with a differential global positioning system (GPS) and an 
accurate position encoder so its location is accurately known by combining the information 
from these two devices. MS displacements of less than 1cm can, therefore, be detected. The 
system is designed for transmission from the MS to the BS. Simultaneous channel sounding 
is performed on all eight branches, which makes it possible to estimate the azimuth of the 
impinging waves at the BS. The sounding signal is a maximum length linear shift register 
sequence of length 127 chips, clocked at a chip-rate of 4.096 Mbs. This chip rate was 
initially used in WCDMA proposals in Europe.The testbed operates at a carrier frequency 
of 1.8 GHz. Additional information regarding the stand-alone testbed can be found in 
Pedersen etal. [2] , Frederiksen etal. [3 ] and Algans et al. [4] . A summary of macrocellular 
measurement environments is given in Table 3.1. 

Channel azimuth-delay spread function at the BS is modeled as 



where the parameters 07, t/ and (pi are the complex amplitude, delay and incidence azimuth 
of the Ith impinging wave at the BS. In general, h (0, t) is considered to be a time-variant 
function, since the constellation of the impinging waves is likely to change as the MS moves 



L 




(3.1) 
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Table 3.1 Summary of macro cellular measurement environments 



Class 



BS antenna height Description of environment 



TU, typical urban 10 and 32 m 



TU 



21 m 



BU, bad urban 21m 



SU suburban 12 m 



The city of Aarhus, Denmark. Uniform 
density of buildings ranging from four to 
six floors. Irregular street layout. 
Measurements were carried out along six 
different routes with an average length of 
2 km. No line-of-sight between MS and BS. 
MS-BS distance varies from 0.2 to 1.1 km 

Stockholm city, Sweden (area 1). Heavily 
built-up area with a uniform density of 
buildings, ranging from four to six floors. 
Ground is slightly rolling. No line-of-sight 
between MS and BS. MS-BS distance 
varies from 0.2 to 1.1 km 

Stockholm city, Sweden (area 2). Mixture of 
open flat areas (river) and densely built up 
zones. Ground is slightly rolling. No 
line-of-sight between MS and BS. MS-BS 
distance varies from 0.9 to 1.6 km 

The city of Gistrup, Denmark. Medium-sized 
village with family houses of one to two 
floors and small gardens with trees and 
bushes. Typical Danish residential area. 
The terrain around the village is rolling 
with some minor hills. No line-of-sight 
between MS and BS. MS-BS distance 
varies from 0.3 to 2.0 km 



along a certain route. The local average power azimuth-delay spectrum is given as 

' L 

J2\<xi\ 2 H<l>-<Pi,r -n) 



P{4>,x) = E 



1=1 



(3.2) 



From Equation (3.2), local power azimuth spectrum (PAS) and the local power delay spec- 
trum (PDS) are given as 



Pa (</>) = J P(<t>,T) dr 
Pd(t) = J P(<p,r) dc/> 



(3.3) 
(3.4) 



The radio channels' local azimuth spread (AS) er A and the local delay spread (DS)ct d are 
defined as the root second central moments of the corresponding variables. The values of the 
local AS and DS are likely to vary as the MS moves within a certain environment. Hence, 
we can characterize er A and er D as being random variables, with the joint pdf / (er A , od)- 
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Their individual PDFs are 



/ A (0a) = J /( cr A,cr D )dCT D 

/d (ct d ) = y /(cta, ctd) d ct a 



(3.5) 



(3.6) 



The function / (ox, cd) can be interpreted as the global joint PDF of the local AS and 
DS. If the expectation in Equation (14.2) is computed over the radio channel's fast fading 
component, we can furthermore apply the approximation 



// 



P(<t>,r) d<j) dr = h chaslDel = h lms (d) h s 



(3.7) 



where /z c hannei is the radio channel's integral path loss, /zi oss (d) is the deterministic long-term 
distance-dependent path loss, while h s is the channel's shadow fading component, which 
is typically modeled by a log-normal distributed random variable [5.6]. The global PDF of 
/is is denoted f & (h & ). The global degree of shadow fading is described by the root second 
central moment of the random shadow fading component expressed in decibel, i.e. 

(rs = Std{l01og 10 (fcs)] (3-8) 

where Std{} denotes the standard deviation. Empirical results for cumulative distribution 
functions (CDF) for ax and cr D are given in Figures 3 . 1 and 3.2, respectively. The log-normal 
fit for op, results is given as 

a A = 10 £aX+Ma (3.9) 

where X is a zero-mean Gaussian distributed random variable with unit variance, 
fi A — E {log 10 (<Ta)} is the global logarithmic mean of the local AS, and e A = Std 
{log 10 (cta)} is the logarithmic standard deviation of the AS. 




Empirical results TU-32 
Log-normal distribution 
" Empirical results BM 
- Log-normal distribution 



10 15 20 

Azimuth spread (degree) 



30 



Figure 3.1 Examples of empirical CDF of AS obtained in different environments. The CDF 
of a log-normal distribution is fitted to the empirical results for comparison. 
(Reproduced by permission of IEEE [4].) 
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Figure 3.2 Examples of empirical CDFs of the DS in different environments. The CDF 
of a log-normal distribution is fitted to the empirical results for comparison. 
(Reproduced by permission of IEEE [4].) 



Table 3.2 Summary of the first and second central moments of the AS, DS, and shadow 
fading in the different environments (reproduced by permission of IEEE [4]) 



Class 


OS 




H-A 


SA 








TU-32 


7.3 dB 


8° 


0.74 


0.47 


0.8 lis 


-6.20 


0.31 


TU-21 


8.5 dB 


8° 


0.77 


0.37 


0.9 fis 


-6.13 


0.28 


TU-21 


7.9 dB 


13° 


0.95 


0.44 


1.2 fis 


-6.08 


0.35 


BU 


10.0 dB 


7° 


0.54 


0.60 


1.7 /is 


-5.99 


0.46 


SU 


6.1 dB 


8° 


0.84 


0.31 


0.5 /xs 


-6.40 


0.22 



Similarly, 

ct d = 10 £d7 +^ (3.10) 

where Y is a zero-mean Gaussian distributed random variable with unit variance, 
)jl d — E {log 10 (ctd)} is the global logarithmic mean of the local DS, and e D = 
Std {log 10 (ctd)} is the logarithmic standard deviation of the DS. A summary of the re- 
sults for these parameters is given in Table 3.2. For characterization of shedowing fading 
see References [1-30]. 



3.2 URBAN SPATIAL RADIO CHANNELS IN MACRO/MICROCELL 
ENVIRONMENT (2.154 GHz) 

The discussion in this section is based on the experimental results collected with a wideband 
channel sounder using a planar antenna array [31]. The signal center frequency was 2154 
MHz and the measurement bandwidth was 100 MHz. A periodic PN-sequence, 255 chips 
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Figure 3.3 The measurement area with all three RX sites; TX-positions of the sample plots 
are marked. (Reproduced by permission of IEEE [31].) 

long, was used. The chip rate was 30 MHz, the sampling rate 120 MHz, giving a over- 
sampling factor 4. The correlation technique was used for the determination of the impulse 
response. Hence, the delay range was 255/30 MHz = 8.5 (is, with a resolution of 1/30 
MHz = 33 ns. The transmit antenna at the MS was a vertically polarized omnidirectional 
discone antenna. The vertical 3 dB beamwidth was 87° and the transmit power 40 dBm. 
Approximately 80 different transmitter positions were investigated. 

The receiving BS was located at one of three different sites below, at and above the 
rooftop level (RX1-RX3, see Figure 3.3). A 16-element physical array with dual polarized 
A/2-spaced patch antennas was combined with a synthetic aperture technique to build a 
virtual two-dimensional (2-D) antenna structure. The patches were linearly polarized at 0° 
(horizontal direction) and 90° (vertical direction). With these 16 x 62 elements the direction 
of arrival (DOA) of incoming waves both in azimuth (horizontal angle) and elevation (verti- 
cal angle) could be resolved using the super- resolution Unitary ESPRIT algorithm [32-34]. 
Note that the number of antenna elements limits the number of identifiable waves, but not 
the angular resolution of the method. Together with a delay resolution of 33 ns, the radio 
channel can be characterized in all three dimensions separately for the two polarizations. 
Array signal processing, including estimation of the DOAs and a comparison of ESPRIT 
with other algorithms, can be found in Glisic [1]. 

One prerequisite for the applicability of the synthetic aperture technique is that the radio 
channel is static during the whole data collection period. To avoid problems, the whole 
procedure was done at night with minimum traffic conditions. 

3.2.1 Description of environment 

A typical urban environment is shown in Figure 3.3 [31] with three receiver locations 
(RX1-RX3) marked by triangles pointing in the broadside direction of the array. Figure 
3.3 also shows all the corresponding TX positions. The location RX 1 (height h^x = 10 m) 
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was a typical microcell site below the rooftop height of the surrounding buildings, and 
measurements were performed with 20 different TX positions. RX 2 (height /zrx— 27 m) 
was at the rooftop level, and 32 TX positions were investigated. RX 3 (height h RX — 21 m) 
was a typical macrocell BS position above rooftop heights, and 27 TX positions were 
measured. 

3.2.2 Results 

The measurement results show that it is possible to identify many single (particular, differ- 
ent) multipath components, impinging at the receiver from different directions. However, 
these components are not randomly distributed in the spatial and temporal domain; they nat- 
urally group into clusters. These clusters can be associated with objects in the environment 
due to the high angular and temporal resolution of evaluation. (Sometimes even individual 
waves, within a cluster, can be associated with scattering objects.) The identification of such 
clusters is facilitated by inspection of the maps of the environment. A cluster is defined as 
a group of waves whose delay, azimuth and elevation at the receiver are very similar, while 
being notably different from other waves in at least one dimension. Additionally all waves 
inside a cluster must stem from the same propagation mechanism. The definition of clusters 
always involves a certain amount of arbitrariness. Even for mathematically 'exact' defi- 
nitions, arbitrary parameters (e.g. thresholds or number of components) must be defined. 
Clustering by human inspection, supported by maps of the environment, seems to give the 
best results. The received power is calculated within each cluster (cluster power) by means 
of unitary ESPRIT and a following beam-forming algorithm. The results are plotted in the 
azimuth-elevation-, azimuth-delay- and elevation-delay-planes 

According to the obvious propagation mechanism, each cluster is assigned to one of 
three different classes: 

• Class 1, street-guided propagation - waves arrive at the receiver from the street level 
after traveling through street canyons. 

• Class 2, direct propagation over the rooftop - the waves arrive at the BS from the rooftop 
level by diffraction at the edges of roofs, either directly or after reflection from buildings 
surrounding the MS. The azimuth mostly points to the direction of the transmitter with 
some spread in azimuth and delay. 

• Class 3, reflection from high-rise objects over the rooftop - the elevation angles are near 
the horizon, pointing at or above the rooftop. The waves undergo a reflection at an object 
rising above the average building height before reaching the BS. The azimuth shows the 
direction of the reflecting building; the delay is typically larger than for class 1 or class 2. 
The sum of the powers of all clusters belonging to the same class is called class power. In 
some cases the propagation history is a mixture of different classes, e.g. street guidance 
followed by diffraction at rooftops. Such clusters are allocated to the class of the final 
path to the BS. 

For the evaluation of delays we define the vector P containing the powers of the clusters 
and vector t of corresponding mean delays. A particular cluster i has mean delay r,-, and 
power Pj . The relation between the delays t and the powers P is modeled as exponential: 



P n oc P(r„) - a&- T " /h 



(3.11) 
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Table 3.3 The model parameters a and b for both received polarizations (VP and HP) 
averaged over all available clusters. The transmitter was VP 



VP-VP VP-HP 


a 
b 


-3.9 dB -3.6 dB 
8.9dB//xs 11.8dB//is 


Table 3 .4 Average delay of the strongest cluster of RX 1 , RX2 and RX3 


RX 


Average delay VP- VP /is Average delay VP-HP its 


1 

2 
3 


0.068 0.071 
0.38 0.28 
0.11 0.048 



Experimental data are fit into the model (3.11) using the least square (LS) estimation. The 
logarithmic estimation error v is defined as 

v = 10 log P- 101ogs(9) (3.12) 

and its standard deviation a v as 

a v — v / var{v} (3.13) 
The summary of the results for parameters a and b is shown in Tables 3.3 and 3.4. 

Pi = ae- r ' /b 

In equation (14.21) er„ was defined as the standard deviation of the logarithmic estimation 
error in dB. This estimation error was found to be log-normally distributed and, up to a 
delay of about 1 /xs, a v is independent of the delay r. The value of a v is 9.0 and 10.0 dB 
(co-and cross-polarization), respectively, averaged over the first microsecond. The average 
powers for different classes of clusters are shown in Table 3.5. Additional data on the topic 
can be found in References [31-41]. 



3.3 MIMO CHANNELS IN MICRO- AND PICOCELL ENVIRONMENT 
(1.71/2.05 GHz) 

The model presented in this section is based upon data collected in both picocell and 
microcell environments [43]. The stochastic model has also been used to investigate the 
capacity of MIMO radio channels, considering two different power allocation strategies, 
water filling and uniform and two different antenna topologies, 4x4 and 2 x 4. It will 
be demonstrated that the space diversity used at both ends of the MIMO radio link is an 
efficient technique in picocell environments, achieving capacities within 14 and 16 b/s/Hz 
in 80 % of the cases for a 4 x 4 antenna configuration implementing water filling at a 
signal-to-noise ratio (SNR) of 20 dB. 
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Table 3.5 Averaged class powers of RX1, RX2 and RX3 



RX 


Class 


Horizontal power, percentage 
of total power 


Vertical power, percentage 
of total power 


i 
i 


i 
i 




yj. 1 /c 




2 


2.4% 


3.8% 




3 


1.1% 


0.4% 


9 


i 
i 


7 J). J /o 


y 1 ,z /o 




2 


4.0% 


2.7% 




3 


2.5% 


0.1% 


3 


1 


46.7 % 


78.0% 




2 


37.2 % 


12.8 % 




3 


16.0% 


9.2% 



The basic parameters of the measurements set-up are shown in Figure 3.4. The following 
notaion is used in the figure: ^ms-ds stands for distance between MS and BS; h^ior the 
height of BS above ground floor, and AS for azimuth spread [43]. 

The vector of received signals at BS can be represented as y(f) = [y\(t), y2(t), ■ • • , 
)>M(t)] T , where y m (t) is the signal at the mm antenna port and [-] T denotes transposition. 
Similarly, the signals at the MS are s (?) = [si(t), s 2 (t), ...s N (t)] T . The NB MIMO radio 
channel H e X MxN , which describes the connection between the MS and the BS, can be 




Figure 3.4 Functional sketch of the MIMO model. (Reproduced by permission of IEEE 
[43].) 
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expressed as [1]: 



H 



an 

«21 



«12 
«22 



«M1 a M2 



OC\N 

U.MN 



(3.14) 



where a mn is the complex transmission coefficient from antenna at the MS to antenna at the 
BS. For simplicity, it is assumed that a mn is complex Gaussian distributed with identical 
average power. However, this latest assumption can be easily relaxed. Thus, the relation 
between the vectors y if) and s (?) can be expressed as 



y(f) = H(f)s(f) 
In the sequel we will use the following correlations: 



,bs 



(3.15) 

(3.16) 
(3.17) 



Rbs — 



Pft 8 Pfl 



P2 B ! S 



P2 B 2 S 



■Pm Pa/2 



P\M 

PlM 



PMM-^ MxM 



(3.18) 



Rms = 



rpMS 

Pi? 



-Pot 



Prf 

P2 M 2 S 



Pm 



P^ 
Pi? 



Paw- 



(3.19) 



The correlation coefficient between two arbitrary transmission coefficients connecting two 
different sets of antennas is expressed as 



CH^.VJ (3.20a) 

which is equivalent to 

Pnlml = Pn in2 P mi m 2 (3.20b) 

provided that Equations (14.24a) and (14.24b) are independent of n and m, respectively. In 
other words, this means that the spatial correlation matrix of the MIMO radio channel is 
the Kronecker product of the spatial correlation matrix at the MS and the BS and is given 
by 

Rmimo = Rms <8> Rbs (3.21) 
where <g> represents the Kronecker product. This has also been confirmed in yu et al. [44]. 
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3.3.1 Measurement set-ups 

The TX is at the MS and the stationary RX is located at the BS. The two set-ups from 
Figure 3.4 provide measurement results with different correlation properties of the MIMO 
channel for small antenna spacings of the order of 0.51 or 1.5X . The BS consists of 
four parallel RX channels. The sounding signal is a MSK-modulated linear shift register 
sequence of a length of 127 chips, clocked at a chip rate of 4.096 Mcs. At the RX, the 
channel sounding is performed within a window of 14.6 /lis, with a sampling resolution of 
122 ns (half-chip period) to obtain an estimate of the complex IR. The narowband (NB) 
information is subsequently extracted by averaging the complex delayed signal components. 
A more thorough description of the stand-alone testbed (i.e. RX and TX) is documented in 
References [23, 45]. The description of the measurement environments is summarized in 
Table 3.6. 

A total of 107 paths were investigated within these seven environments. The first measure- 
ment set-up was used to investigate 15 paths in a microcell environment, i.e. environment 
A in Table 3.6. The MS was positioned in different locations inside a building while the 
BS was mounted on a crane and elevated above roof-top level (i.e. 9 m) to provide direct 
line-of-sight to the building. The antenna was located 300 m away from the building. The 
second set-up was used to investigate 92 paths for both microcell and picocell environments, 



Table 3.6 Summary and description of the different measured environments (reproduced 
by permission of IEEE [43]) 



Cell type 


Environment 


MS locations 


Measurement set-up 


Description 


Microcell 


A 


15 


1st 


The indoor environment 
consists of small 
offices with windows 
metallically shielded 
-300 m between MS 
andBS 




B 


13 


2nd 


The indoor environment 
consists of small 
offices -31-36 m 
between MS and BS 


Picocell 


C 


21 


2nd 


The indoor environment 
is the same as in A 




D 


12 


2nd 


Reception hall - large 
open area 




E 


18 


2nd 


Modern open office with 
windows metallicaly 
shielded 




F 


16 


2nd 


The indoor environment 
is the same as in B 




G 


12 


2nd 


Airport - very large 
indoor open area 
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i.e. environment B and C-G, respectively, as shown in Table 3.6. The distance between the 
BS and the MS was 31-36 m for microcell B, with the BS located outside. 



3.3.2 The eigenanalysis method 

The eigenvalue decomposition (EVD) of the instantaneous correlation matrix R = HH H (not 
to be confused with RmIMC))' wnere t'] H represents Hermitian transposition, can serve as 
a benchmark of the validation process. The channel matrix H may offer K parallel sub- 
channels with different mean gains, with K — Rank (R) < min (M, AO where the functions 
Rank (•) and min (•) return the rank of the matrix and the minimum value of the arguments, 
respectively [27]. The kth eigenvalue can be interpreted as the power gain of the kth sub- 
channel [27]. In the following, represents the eigenvalues. 



3.3.3 Definition of the power allocation schemes 

In the situation where the channel is known at both TX and RX and is used to compute the 
optimum weight, the power gain in the kth subchannel is given by the kth eigenvalue, i.e. 
the SNR for the kth subchannel equals 

Vk = h^ (3.22) 

CT N 

where is the power assigned to the kth subchannel, a* is the kth eigenvalue and er^ 
is the noise power. For simplicity, it is assumed that cr^ = 1. According to Shannon, the 
maximum capacity normalized with respect to the bandwidth (given in terms of b/s/Hz- 
spectral efficiency) of parallel subchannels equals [46] 

K 

C = £log 2 (l + )*) (3.23) 



k=\ 
K 



where the mean SNR is defined as 



= £>g 2 (l + A,4) (3-24) 



E [Prx] E [P tx ] 
SNR = 1 ^ = 1 2 TXJ (3.25) 

Given the set of eigenvalues {A^}, the power P* allocated to each subchannel k was deter- 
mined to maximize the capacity using Gallager's water filling theorem [27] such that each 
subchannel was filled up to a common level D, i.e. 

1 + P! = . . ■ = J- + P t = ... D (3.26) 

Ai Afz 

with a constraint on the total TX power such that 

K 

k=i 



58 CHANNEL MODELING FOR 4G 



30 




0 I 1 1 1 1 1 i — 

0 5 10 15 20 25 30 

SNR (dB) 



Figure 3.5 Capacity (10 % level) vs SNR for example 1 (picocell decorrelated) and example 
2 (microcell-correlated). The water filling power allocation scheme. 

where Pxx is the total transmitted power. This means that the subchannel with the highest 
gain was allocated the largest amount of power. In the case where l/A* > D then = 0. 
When the uniform power allocation scheme is employed, the power P*. is adjusted accord- 
ing to Pi — ■ ■ ■ — P K . Thus, in the situation where the channel is unknown, the uniform 
distribution of the power is applicable over the antennas [27] so that the power should be 
equally distributed between the N elements of the array at the TX, i.e. 

P„ = — , Vn=l...N (3.28) 
N 

Some results are given in Figure 3.5. 



3.4 OUTDOOR MOBILE CHANNEL (5.3 GHz) 

In this section we discuss the mobile channel at 5.3 GHz. The discussion is based on 
measurement results collected at six different sites [47] . Site A is an example of a dense urban 
environment; the transmitting antenna was about 45 m above ground level, representing a 
case with the BS antenna over rooftops. Site B is a dense urban residential environment. 
Here the transmitting antenna was placed at a mast with a height of 4 m, which is a typical 
case with the BS antenna lower than rooftops. The measurement routes for this site are 
shown in Figure 3.6(a). The receiving antenna mobile station was at a height of 2.5 m on 
top of a car for both of the sites mentioned above. Site C is located in a typical city center. 
The goal was to place the transmitter at some elevation relative to ground, but still keep 
it below the rooftops. The transmitting antenna was placed at a height of 12 m and the 
receiving antenna was on top of a trolly with a height of 2 m above ground level. In site 
C, the rotation was measured using a directive horn antenna. The 3 dB beamwidth of the 
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(a) (b) 




Figure 3.6 (a) Measurements routes for site B with TX height of 4 m. (b) Rotation mea- 
surements in an urban environment. (Reproduced by permission of IEEE [47].) 



horn antenna was 30° in the H-plane and 37° in the E-plane and the peak sidelobe level was 
26 dB. The specific environment for rotation measurements is shown in Figure 3.6(b). Site 
D represents semiurban/semirural residential area. The three-story buildings are the tallest 
ones around, and the transmitting antenna was placed over rooftops at the height of 12 m 
from ground level. Site E was selected to represent the rural case. The transmitting antenna 
was placed on top of a 5 m mast at the hilltop so that the antenna was about 55 m above 
the surrounding area. Site F represents a typical semiurban/urban case. The transmitter was 
placed on top of a 5 m mast. The receiving antenna was always on top of a car at a height of 
2.5 m. The routes were measured using the wideband channel sounder. System parameters 
are summarized in Table 3.7 



Table 3.7 System configuration for mobile measurements in urban (U), 
suburban (S) and rural (R) areas 



Receiver 



Direct sampling/5.3 GHz 



Transmitter power 
Chip frequency 
Delay range 
Doppler range 
Measurement rate 
Sampling frequency 
IR/wavelength 
Receiver velocity 
Antennas and polarization 



30dBm 
30 MHz 
4.233 us 

124 Hz (U), 62 Hz (S, R) 
248 sets/s (U), 124 sets/s (S, R) 
120 Ms/s 
5 (U), 4.2 (S,R) 
2.80 m/s (U), 1.67 m/s (S, R) 
Omni-directional antenna with 
1 dBi gain; vertical polarization 
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Table 3.8 Path loss models for urban environments 



TX height: 4 m TX height: 12m TX height: 45m 

Urban models n b (dB) Std (dB) n b (dB) Std (dB) n b (dB) Std (dB) 



LOS 1.4 58.6 3.7 2.5 35.8 2.9 3.5 16.7 4.6 

NLOS 2.8 50.6 4.4 4.5 20.0 1.7 5.8 -16.9 2.8 



Table 3.9 Path loss models for suburban and rural environments 



Suburban 

Rural Line-of-sight (TX height: 5 m); 

TX height: 55 m no Line-of-sight (TX height: 12 m) 



Models 


n 


fo(dB) 


Std (dB) 


n 


b (dB) 


std (dB) 


LOS 


3.3 


21.8 


3.7 


2.5 


38.0 


4.9 


NLOS 


5.9 


-27.8 


1.9 


3.4 


25.6 


2.8 



3.4.1 Path loss models 

The following model is used for path loss 

PL (dB) = b + lOn log 10 d (3.29) 

where do — 1 m, n is attenuation exponent, b is the intercept point in the semilog coordinate, 
and d(m) is the distance from receiver to transmitter. The measurement distances are about 
30-300 m in this case. Some results are given in Tables 3.8 and 3.9. The measurements 
results for delays are summarized in Table 3.10 



3.4.2 Path number distribution 

The multipath number distribution was regarded as Poisson's and modified Poisson's dis- 
tributions in Suzuki [48] and modified Poisson's distribution has been shown to have good 
agreement with the experimental results in some cases. However, the modified Poisson's 
distribution does not have an explicit expression, but just a process. Therefore, it is not 
convenient for practical use. In Zhao et al. [47], another simple and useful path number 
distribution was suggested by considering that the path number variation of radio waves in 
land mobile communications is a Markov process at finite state space, and it was shown to 
have good agreement with the experimental results. The path number distributions given 
by Poisson and Gao can be expressed as 

„N,-N 

P (N) = 777 777i e ~" (3 - 30) 

(7V T - AO ! 

n N T -N 

P(N) = C ^(T^ (331) 
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Table 3.10 Measured values for mean excess delay and rms delay spread 



( ) Tx height in meters Urban Suburban Rural 



Mean excess delay (ns) 




JO 


(4) 










LOS 




42 




36 




29 


(551 






102 


(45) 










NLOS 




70 


(4) 


68 


(12) 






rms delay spread (ns) 




A A 

44 


(4) 










IVlean 


LOS 


43 




25 


wj 


22 








88 


(45) 












NLOS 


44 


(4) 


66 


(12) 










ZD 


(4) 










Median 






(12) 




(5) 


1 c 

1 J 


P5) 






86 


(45) 












NLOS 


37 


(4) 


63 


(12) 






CDF 




93 


(4) 










<90% 


LOS 


64 


(12) 


57 


(5) 


44 


(55) 






120 


(45) 












NLOS 


63 


(4) 


105 


(12) 







where AT is variable and C means combination. Nj is the maximum number of paths that 
the mobile can receive. The parameters r\ and Nj can be fitted by the experimental data. For 
Poisson's probability density function (PDF), the mean path number is(A0 = r\. For Gao's 
PDF, the mean value is (N) — N T / (1 + rj). The empirical path number distributions for 
the outdoor measurements are fitted using Equations (3.30) and (3.31), respectively. The 
path numbers are obtained from measured data by counting the peaks of the power delay 
profiles. The best fit is obtained by minimizing the following standard deviation: 



j~ r E (/>' " PO 2 (3-32) 

where p' e is the experimental probability corresponding to path number / and p' is the 
fitted probability using Equations (3.30) and (3.31). The fitted parameters are available 
in Table 3.11. If the dynamic range is cut at different levels, for example, —25, —20 and 
— 15 dB, the fitting parameters in Equations (3.30) and (3.31) will be changed, but the path 
number distributions still follow Poisson's and Gao's distributions. 

3.4.3 Rotation measurements in an urban environment 

The rotation measurements at points Pi and P2were performed at site C shown in Fig- 
ure 3.6(b). The transmitter height was 12 m and the receiver was on a rotating stand at the 
height of 1.6 m and close to the receiver is a large open square. In the experiments, large 



Std = 

\ 
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Table 3.11 Path number distributions for outdoor environments 











Urban 




Suburban 


Rural 


TX4m 


TV 1 H 

IX I _ in 


TV A C 


TX 12 m 


TV CC ~~ 

IX 55 m 


LOS 


NLOS 


LOS 


LOS 


LOS 


NLOS 


LOS 


1 


Poisson 


2.8 


4.2 


3.3 


6.0 


1.2 


4.5 


1.8 




Gao 


4.7 


4.5 


3.5 


2.7 


9.0 


3.3 


4.0 




N T 


16 


21 


14 


22 


13 


20 


9 


(N) 


Poisson 


2.8 


4.2 


3.3 


6.0 


1.2 


4.5 


1.8 




Gao 


2.8 


3.8 


3.2 


6.0 


1.3 


4.7 


1.8 




Experiment 


3.4 


4.2 


3.5 


6.2 


2.4 


5.0 


1.7 



0 




180 



Figure 3.7 RMS delay spread with different rotation angles in the azimuth plane. 

excess delays up to 1.2 /xs and rms delay spread, shown in Figure 3.7, of about 0.42 fx,s are 
found. 

The power angular profiles (PAPs) P, (<p) of the measurements were calculated using 
the maximal ratio combining algorithm in the delay domain [50]: 

P r (0) = a cal / \h(T,<j>)\ z dz (3.33) 

« r min 

where a ca i is a factor which is obtained from the calibration measurement with a cable and 
an attenuator, T m ; n and r max are the delays of the first and last detectable IR components, and 
4> is the angle of arrival of the waves in the azimuth plane. In the rotation measurements, 
the dynamic range is cut at —26 dB relative to the strongest path. Some results for angular 
profiles of relative received power are shown in Figure 3.8. The number of paths as a function 
of asimuth angle is shown in Figure 3.9. 
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ure 3.9 Numbers of paths with different rotation angles in the azimuth plane. (Repro- 
duced by permission of IEEE [47].) 
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3.5 MICROCELL CHANNEL (8.45 GHz) 

In this section spatio-temporal channel characterization in a suburban non-line-of-sight 
microcellular environment is discussed. Figure 3.10 shows a map of the environment under 
consideration [49]. This is a residential area with predominantly wooden houses of 8 m 
average height, considered to be a typical suburban microcellular environment of size 
600 x 600 m 2 . The traffic was very light and the environment was considered to be static 
throughout the experiments. The non-line-of-sight (NLOS) transmitter and receiver shown 
in Figure 3.10 are considered. The distance between the transmitter and the receiver was 
219 m. As a transmitter antenna, simulating the MS, a vertically polarized omnidirectional 
half-wave sleeve dipole was set at a height of 2.7 m. At the receiving point, corresponding 
to the BS, the azimuth-delay profile was measured. This was done by rotating a vertically 
polarized parabolic antenna which had azimuth and elevation beamwidths of about 4° , at 
an azimuth step of 4°, and was set at a height of 4.4 m. At each azimuth step, the delay 
profile was measured using a delay profile measurement system described in References 
[49, 50]. At a center frequency of 8.45 GHz and with a chip rate of 50 Mcs, a seven-stage 
M sequence with a dynamic range of 42 dB was transmitted. A correlation receiver was 
used to produce the delay profile. 

The electric parameters used in the ray-tracing simulation are presented in Table 3.12. The 
wooden houses are modeled by the concrete, as the surfaces of the houses were covered with 
concrete-like paint. To compare with the experimental results, the directivity of the antenna 
and the autocorrelation of the pseudo-random noise (PN) sequence were convolved with 
the result of the ray-tracing simulation. Therefore, the path gain includes the antenna gain. 
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Figure 3. 10 The microcellular environment. (Reproduced by permission of IEEE [49].) 



Table 3.12 Electrical parameters used in the ray-tracing simulation 





e r 


a (s/m) 


Concrete [16] 


5.5 


0.023 


Foliage [15] 


1.2 


0.0003 


Ground [18] 


15.0 


1.3 


Metal 




oo 
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3.5.1 Azimuth profile 

Azimuth profiles are obtained by summing up the power of azimuth-delay profiles with 
respect to the delay time. The forward arrival waves within the range from —40 to 44° were 
observed in both experimental and simulation data sets. The experimental profile had a floor 
level of about 30 dB below the peak, but this level was low enough so that its effect on the 
transmission property was negligibly small. 

3.5.2 Delay profile for the forward arrival waves 

For the forward arrival waves (—40 to 44°), the delay profiles are obtained by summing up 
the azimuth-delay profiles with respect to the azimuth. The experimental result exhibits an 
exponential decay. The results of least squares fitting can be expressed as 

P(r) = -0.038r - 28.6 (3.34) 

where P is the path gain in dB, and r is the delay time in nanoseconds. 

3.5.3 Short-term azimuth spread for forward arrival waves 

The short-term AS for the forward arrival waves, a Vl is defined as 

°V(r) = y^ 2 (r)) - {<p(T)) 2 (3.35) 

where {■} is the average of the <p weighted by the power for a fixed delay time, r. At each 
delay time, the threshold level is set at 30 dB below the peak in the profile in order to 
calculate the short-term AS in Equation (3.35). The CDF of the short-term AS is shown in 
Figure 3.11. The solid line indicates the experimental result and the dotted line indicates 
the simulation result. It is noted that the simulation result is obtained within a range of 




Short-term angular spread (deg) 

Figure 3.11 The CDF of the short-term AS for forward arrival waves. Solid line: experiment; 
dotted line: simulation. Reproduced by permission of IEEE [49].) 
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delay times from 700 to 880 ns. Although the distribution functions of the experiment 
and the simulation look slightly different, their average and their standard deviation are 
both in agreement. A Gaussian distribution has been used as an approximation in the 
figure. 



3.6 WIRELESS MIMO LAN ENVIRONMENTS (5.2 GHz) 

The presentation in this section is based on results of a measurement campaign [52] 
in two courtyards in the 5.2 GHz band assigned for wireless LANs [e.g. HIPERLAN 
(see www.etsi.org) or IEEE 802.1 la]. These standards specify wireless communication be- 
tween computers, which is a compelling application for MIMO systems. For measurement a 
channel sounder with a bandwidth of 120 MHz, connected via a fast RF switch to a uniform 
linear receiver antenna array, was used. This array consisted of Nr = 8 antenna elements 
(±60° element-beam width), plus two dummy elements at each end of the array. All these 
components together constituted a single-directional channel sounder. A virtual array at the 
transmitter consisted of a monopole antenna mounted on an Z-T-positioning device with 
stepping motors. 

The experiment started by positioning a transmit antenna at a certain position. At the 
receiver, the RF switch was connected to the first antenna element of the array, so that the 
transfer function (measured at 192 frequency samples) from the first transmit to the first 
receive element of the array was measured. Then, the switch was connected to the next 
receive antenna element, and the next transfer function was measured. The measurement 
of all those transfer functions was repeated 256 times, in order to assess the time variance 
of the channel (see below). Then, the transmit antenna was moved to the next position, 
and the procedure was repeated. Nj — 16 transmit antenna positions were used situated 
on a cross (i.e. eight positions on each axis of the cross) and bursts of complex channel 
transfer functions were recorded. Any virtual array requires that the channel remains static 
during the measurement period. One complete measurement run (2x8 antenna positions 
at TX x 8 spatial samples at RX x 192 frequency samples and 256 temporal samples gives 
16 x 8 x 192 x 256 = 6 291 456 complex samples) took about 5 min. 

3.6.1 Data evaluation 

Starting from the four-dimensional transfer function (time, frequency, position of RX an- 
tenna, position of TX antenna), the Doppler-variant transfer function by Fourier trans- 
forming the 256 temporal samples (with Hanning windowing) is computed first. Next, all 
components that do not exhibit zero Doppler shift (Doppler filtering) are eliminated. Those 
components correspond, for example, to MPCs scattered by leaves moving in the wind. The 
eliminated components carry on the order of 1 % of the total energy. The three-dimensional 
(static) transfer function obtained in that way is then evaluated by Unitary ESPRIT [34] to 
estimate the delays, t, . Unitary ESPRIT is an improved version of the classical ESPRIT 
algorithm discussed in Glisic [1]. They both estimate the signal subspace for extraction of 
the parameters of (spatial or frequency) harmonics in additive noise. One important step in 
ESPRIT is the estimation of the model order. Different methods have been proposed in the 
literature for that task. The relative power decreases between neighboring eigenvalues with 
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additional correction by visual inspection of the Scree Graph, showing that the eigenvalue 
is an option used for generating the results presented in the sequel. 

After estimation of the parameters r, , we can determine the corresponding 'steering' 
matrix, A T . Subsequent beamforming with its Moore-Penrose pseudoinverse [34,53-56] 
A+ gives the vector of delay- weights for all x R ,x T 



h r (x T , x R ) = A+T f (x T , x R ) 



(3.36) 



where Tf is the vector of transfer coeffcients at the 192 frequency sub-bands sounded. 
This gives us the transfer coeffcients from all positions x T to all positions x R separately 
for each delay r, . Thus, one dimension, namely the frequency, has been replaced by the 
parameterized version of its dual, the delays. 

For the estimation of the direction of arrivals (DOA) in each of the two-dimensional 
transfer functions, ESPRIT estimation and beamforming by the pseudo-inverse are used 



\r (Tj , x T ) = A+ R h xR (Tj , x T ) 
Finally for the direction of departure (DOD) we have 



Vr (?;, fRjj) = A+ T h xT (t,-, (f>R,i,j) 



(3.37) 



(3.38) 



Figure 3.12 illustrates these steps. 

The procedure gives us the number and parameters of the MPCs, i.e. the number and 
values of delays, which DOA can be observed at these delays and which DOD corresponds 
to each DOA at a specific delay. Furthermore, we also obtain the powers of the multi- 
path channels (MPCs). One important point in the application of the sequential estimation 
procedure is the sequence in which the evaluation is performed. Roughly speaking, the 
number of MPCs that can be estimated is the number of samples we have at our disposal. 



Doppler Filtering 




Figure 3.12 Sequential estimation of the parametric channel response in the different do- 
mains: alternating estimation and beamforming. (Reproduced by permission 
of IEEE [52].) 
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3.6.2 Capacity computation 

In a fading channel, the capacity is a random variable, depending on the local (or instan- 
taneous) channel realization. In order to determine the cdf of the capacity, and thus the 
outage capacity, we would have to perform a large number of measurements either with 
slightly displaced arrays, or with temporally varying scatterer arrangement. Since each 
single measurement requires a huge effort, such a procedure is highly undesirable. 

To improve this situation, an evaluation technique that requires only a single measurement 
of the channel is used. This technique relies on the fact that we can generate different 
realizations of the transfer function by changing the phases of the multipath components. It 
is a well-established fact in mobile radio that the phases are uniformly distributed random 
variables, whose different realizations occur as transmitter, receiver or scatterers move [27]. 
We can thus generate different realizations of the transfer function from the mth transmit 
to the kth receive antenna as 



hk,m if) = J2 a i exp{-y'^d[fesin (<t> RJ ) + m sin (<faj)]} 

i 

x exp (- jlnfti) exp (;'«, ) 



(3.39) 





/ hn 


hn ■ 


h\N T ^ 


H = 


h 2 \ 


ll22 ■ 


h2N T 




\ h N m 


hN R2 ■ 





where a, is a uniformly distributed random phase, which can take on different values for the 
different MPCs numbered i . Note, however, that a, stays unchanged as we consider different 
antenna elements k and m. To simplify discussion, we for now consider only the flat-fading 
case, i.e. t, = 0. We can thus generate different realizations of the channel matrix H 



(3.40) 



by the following two steps: 

(1) From a single measurement, i.e. a single snapshot of the channel matrix, determine 
the DOAs, and DODs of the MPCs as described earlier in the section. 

(2) Compute synthetically the impulse responses at the positions of the antenna ele- 
ments, and at different frequencies. Create different realizations of one ensemble 
by adding random phase factors (uniformly distributed between 0 and 2it) to each 
MPC. For each channel realization, we can compute the capacity from [ 97] 



C = log 2 det ^1+ ^"H H H 



(3.41) 



where p denotes the SNR. I is the identity matrix and superscript H means Hermitian 
transposition. For the frequency-selective case, we have to evaluate the capacity by 
integrating over all frequencies 



C = / log 2 det 



I+-^H H (/)H(/) 



(3.42) 



Here, H(/) is the frequency-dependent transfer matrix. The integration range is 
the bandwidth of interest. 
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3.6.3 Measurement environments 

As an example the following scenarios are evaluated with the procedure described above 
[52]: 

• Scenario I - a courtyard with dimensions 26 x 27m, open on one side. The RX-array 
broadside points into the center of the yard; the transmitter is located on the positioning 
device 8 m away in LOS. 

• Scenario II- closed backyard of size 34 x 40 m with inclined rectangular extension. The 
RX-array is situated in one rectangular corner with the array broadside of the linear array 
pointing under 45° inclination directly to the middle of the yard. The LOS connection 
between TX and RX measures 28 m. Many metallic objects are distributed irregularly 
along the building walls (power transformers, air-condition fans, etc.). This environment 
looks very much like the backyard of a factory (Figure 3.13). 

• Scenario III - same closed backyard as in Scenario II but with artificially obstructed LOS 
path. It is expected that the metallic objects generate serious multipath and higher-order 
scattering that can only be observed within the dynamic range of the device if the LOS 
path is obstructed. 

• Scenario IV - same as scenario III but with different TX position and LOS obstructed. 
The TX is situated nearer to the walls. More details about the senarios can be found in 
Steinbauer et al. [57]. 

Some of the measurements results for these scenarios are presented in Figure 3.14 




0 10 20 30 40 50 

X-Coordinate (m) 



Figure 3.13 Geometry of the environment of scenarios II-IV (backyard) in top view. Su- 
perimposed are the extracted DOAs and DODs for scenario III. (Reproduced 
by permission of IEEE [52].) 
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Figure 3.14 The CDFs of the MIMO channel capacity encountered in scenarios 1-IV, and 
the cdf for an ideal channel. The SNR is 20 dB, and 4x4 antenna elements 
were used. 



3.7 INDOOR WLAN CHANNEL (1 7 GHz) 



In this section we discuss the indoor radio propagation channel at 17 GHz. The presentation 
is based on results reported in Rubio et al. [58]. Wideband parameters, such as coherence 
bandwidth or rms delay spread, and coverage are analyzed for the design of an OFDM-based 
broadband WLAN. The method used to obtain the channel parameters is based on a simulator 
described in Rubio et al. [58]. This simulator is a site-specific propagation model based on 
three-dimensional (3-D) ray-tracing techniques, which has been specifically developed for 
simulating radio coverage and channel performance in enclosed spaces such as buildings, 
and for urban microcell and picocell calculations. The simulator requires the input of the 
geometric structure and the electromagnetic properties of the propagation environment, 
and is based on a full 3-D implementation of geometric optics and the uniform theory of 
diffraction (GO/UTD) . Examples of the measurement environments are given in Figure 3.15. 
The results for coherence bandwidth B c — I /a r rms are given in Table 3.13 and Figure 3.16. 

A further requirement related to the correct and efficient channel estimation process by 
the receiver is the selection of a number of subcarriers in OFDM satisfying the condition 
of being separated between approximately B c /5 and B c /10. Results for delay spread are 
shown in Figure 3.17 and Tables 3.14 - 3.17. 

The results for the path loss exponent and k factor are given in Figure 3.18 and Table 
3.18 and Table 3.19. For channel modeling purposes, the mean power of the received signal 
will be represented as 



^RxIdB — ^TxIdB + GrxIdB 



dB 



-fsldB 



10 ■ log 



Jo 



PDP(t) dt 



(3.43) 



where Tjx is the mean power at the transmitting antenna input, Gtx is the transmitting 
antenna gain while Grx is the receiving antenna gain. Lf s is free space propagation losses, 



(a) 




(c) 

Figure 3.15 (a) ETSIIT hall (49 x 26 m); (b) DICOM, floors 2 and 3 (34 x 20 m); (c) office 
building (72 x 38 m); 3-D representations 63. (Reproduced by permission of 
IEEE [58].) 



Table 3.13 B c at 17 GHz 



Place 


Coherence bandwidth (MHz) 


Mean 


Standard deviation 


Hall 


24.85 


12.35 


Floors 


14.44 


9.85 


Building 


22.86 


10.24 


Total 


20.72 


11.56 
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Figure 3.16 B c CDF at 17 GHz. (Reproduced by permission of IEEE [58].) 




Figure 3.17 (a) The RMS delay spread CDF (B c = 1 /ar rms ). (b) Maximum delay CDF, 30 
dB criterion, (c) Maximum delay CDF, 20 dB criterion, (d) Alpha CDF. 
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Table 3.14 The RMS delay spread CDF. (Reproduced by permis- 
sion of IEEE [58]) 

CDF value RDS value 



0.2 12.1 ns 

0.4 14.3 ns 

0.6 17.5 ns 

0.8 34.3 ns 

1 58.3 ns 



RDS, root delay spread. 

Table 3.15 Maximum delay CDF, 30 dB criterion. (Reproduced 
by permission of IEEE [58]) 

CDF value r max value 



0.2 62 ns 

0.4 76 ns 

0.6 101 ns 

0.8 122 ns 

1 197 ns 



Table 3.16 Maximum delay CDF, 20 dB criterion. (Reproduced 
by permission of IEEE [58]) 

CDF value T m . ix value 



0.2 51 ns 

0.4 56 ns 

0.6 69 ns 

0.8 94 ns 

1 156 ns 



Table 3.17 Alpha CDF, B c = l/ax vms 



CDF value Alpha value 

0.2 2.17 
0.4 2.67 
0.6 3.75 
0.8 4.44 
1 5.78 
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Figure 3.18 CDF of path loss exponent n. 

Table 3.18 Mean values of n 



Type of path 


LOS 


OLOS 


NLOS 


n Mean value 


1.68 


2.14 


2.61 



Table 3.19 Fading statistic over distance, LOS case 

Radius (m) K factor 

4 17 

5 10 

6 9 

7 8 

8 6 

9 5 
10 1 



given by 

Lf s | dB = 32.45 dB + 20 ■ log 10 (4m + /mhz) 

and PDP(f ) the modeled power delay profile. Once the PDF is modeled, to obtain the discrete 
channel impulse response, hi, we only have to add a random phase to the square root of 
each delay bin amplitude, as follows: 

hi = *JjHe j4i 4>i r.v. unif[0, 2tt] (3.44) 

where hi is the z'th bin of the modeled channel impulse response and pi, the module of the 
ith bin of the modeled power delay profile. 
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It can be assumed that phases of different components of the same channel impulse 
response are uncorrected at the frequency of interest (17 GHz), because their relative range 
is higher than a wavelength, even for high-resolution models [59]. As the total bandwidth 
assigned to the communication is 50 MHz, a selection of 10 ns for the bin size must be 
made. Using 99 % of the total power criterion for the maximum duration of the PDF, the 
former bin size selection leads to a total of nine taps for the LOS case and 17 for the NLOS 
case. 

The statistical variability of the bin amplitudes has been modeled following different 
probability density functions. Taking into account the fact that the area of service of future 
applications (SOHO - small office, home office) has small ranges, the variability has been 
analyzed considering a medium-scale, that is, the environment is divided in to the LOS area 
and the NLOS one. In the LOS case, a Frechet PDF [60] is chosen for the first bin and 
exponential PDFs for the rest. A continuous random variable X has a Frechet distribution 
if its PDF has the form 

f(x;a;k) = -^(-^expj- \x > 0; a, k > 0 (3.45) 

A Frechet variable X has the CDF 

F(x;a;k) = exp|- (3.46) 

This model has a scale structure, with a a scale parameter and k a shape parameter.A 
continuous random variable X has an exponential distribution if its PDF has the form 

f(x;n)= ^expj- ^-)J > x - °> M- cr >0 (3.47) 

This PDF has location-scale structure, with a location parameter, ji, and a scale one, a . The 
CDF of the exponential variable X is 

F(x;n) = l- e xp\-(^^)} (3.48) 



These PDFs were considered the most suitable after a fitting process. The NLOS case 
needs a combination of exponential and Weibull PDFs for the first bin and exponential 
PDFs for the others. A continuous random variable X has a Weibull distribution if its PDF 
has the form 

f(x;a;k) = - expj- (-)*J \x > 0; a, k > 0 (3.49) 

While the CDF is 

F(x;a;k) = 1 -expj- (3-50) 

This model has a scale structure, that is, a is a scale parameter, while k is a shape parameter. 
Tables 3.20 and 3.21 show the probability density functions employed for LOS and NLOS 
channel models [58]. 

For both tables, the units of a parameters are Hz (s _1 ), while k has no units. These units 
have no physical correlation but make the last term of Equation (3.43) nondimensional, 
as it represents a factor scale between the free space behavior and the real one. The mean 
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value of the probability density functions is so high due to the ulterior integral over the time 
(in seconds) required, and the PDF duration (tens of nanoseconds). As expected, the mean 
value of the first bin is the highest, since it includes the direct ray (LOS case). Additional 
details on the topic can be found in References [59-71]. 



3.8 INDOOR WLAN CHANNEL (60 GHz) 

Based on the results reported in Hao et al. [72], in this section we present spatial and 
temporal characteristics of 60 GHz indoor channels. In the experiment, a mechanically 
steered directional antenna is used to resolve multipath components. An automated system 
is used to precisely position the receiver antenna along a linear track and then rotate the 
antenna in the azimuthal direction, as illustrated in Figure 3.19. The precisions of the track 
and spin positions are less than 1 mm and 1°, respectively. When a highly directional antenna 
is used, the system provides high spatial resolution to resolve multipath components with 
different angles of arrival (AOAs). The sliding correlator technique was used to further 
resolve multipath components with the same AOA by their times of arrival (TOAs). The 
spread spectrum signal has a RF bandwidth of 200 MHz, which provids a time resolution 
of approximately 10 ns. 

For this measurement campaign, an open-ended waveguide with 6.7 dB gain is used as 
the transmitter antenna and a horn antenna with 29 dB gain is used as the receiver antenna. 
These antennas are chosen to emulate typical antenna systems that have been proposed 
for millimeter-wave indoor applications. In these applications a sector antenna is used at 
the transmitter and a highly directional antenna is used at the receiver. Both antennas are 
vertically polarized and mounted on adjustable tripods about 1.6 m above the ground. The 
theoretical half -power beamwidths (HPBW) are 90° in azimuth and 125° in elevation for 
the open-ended waveguide and 7° in azimuth and 5.6° in elevation for the horn antenna. 



Track measurements 

A/4 



TX 




Track step: A/4 Number of steps: 80 



Spin measurements 




Rx 

Track step: 5 A Number of steps: 4 
Spin step: 5° Number of steps: 72 



Figure 3.19 Track and spin measurement procedure. (Reproduced by permission of IEEE 
[72].) 
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3.8.1 Definition of the statistical parameters 
3.8. 1. 1 Path loss and received signal power 

The free-space path loss at a reference distance of do is given by 

PL ft (db) = 201og^^ (3.51) 

where X is the wavelength. Path loss over distance d can be described by the path loss 
exponent model as follows: 

~PL(d)[dB] = PL fs (d 0 )[dB] + lOn log 10 (d/d 0 ) (3.52) 

where PL(d) is the average path loss value at a transmitter - receiver (TR) separation of 
d and n is the path loss exponent that characterizes how fast the path loss increases with 
the increase in TR separation. The path loss values represent the signal power loss from the 
transmitter antenna to the receiver antenna. These path loss values do not depend on the 
antenna gains or the transmitted power levels. For any given transmitted power, the received 
signal power can be calculated as 

P r [dBm] = P t [dBm] + G t [dB] + G r [dB] - TZ{d)[6S] (3.53) 

where G t and G r are transmitter and receiver gains, respectively. In this measurement 
campaign, the transmitted power level was 25 dBm, the transmitter antenna gain was 6.7 dB, 
and the receiver antenna gain was 29 dB. 



3.8.1.2 TOA parameters 

TOA parameters characterize the time dispersion of a multipath channel. The calculated 
TOA parameters include mean excess delay (f ), rms delay spread (er r ), and also timing 
jitter [S(x)] and standard deviation [A(x)], in a small local area. Parameters of f and er T are 
given as [72]: 

N N 

E Pitt _ E P<rf 

r = 4 ,<x r = Vr 2 -(r)2,r2= ^ (3.54) 

E Pi E P 

i=i i=i 

where P, and r, are the power and delay of the ith multipath component of a PDF, respec- 
tively, and Ms the total number of multipath components. Timing jitter is calculated as the 
difference between the maximum and minimum measured values in a local area. Timing 
jitter S(x) and standard deviation A(x) are defined as 



M M ~ 

S(x) — max{x,} — min{x,}, A(x) = y x 2 — (x) 2 



! = 1 

i M i M 

- — __ \ ^ ~~ 2 _ _ \ ^ 2 



, ... (3.55) 

M ^—f M 

i=i i=i 



where x, is the measured value for parameter x(f or <r T ) in the ith measurement position 
of the spatial sampling and M is the total number of spatial samples in the local area. For 
example, for the track measurements, M was chosen to be 80. 
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Mean excess delay and rms delay spread are the statistical measures of the time disper- 
sion of the channel. Timing jitter and standard deviation of f and a T show the variation 
of these parameters over the small local area. These TOA parameters directly affect the 
performance of high-speed wireless systems. For instance, the mean excess delay can be 
used to estimate the search range of rake receivers and the rms delay spread can be used 
to determine the maximum transmission data rate in the channel without equalization. The 
timing jitter and standard deviation parameters can be used to determine the update rate for 
a rake receiver or an equalizer. 

3.8. 1.3 AOA parameters 

AOA parameters characterize the directional distribution of multipath power. The recorded 
AOA parameters include angular spread A, angular constriction y, maximum fading angle 
#max and maximum AOA direction. Angular parameters A, y and 6 m . dx are defined based 
on the Fourier transform of the angular distribution of multipath power, p(6) [74]: 

where 

pin 

F n = / p(9)exp(jne)d0 (3.57) 
Jo 

F„ is the «th Fourier transform of p{9). As shown in Durgin and Rappaport [74], angular 
spread, angular constriction and maximum fading angle are three key parameters to charac- 
terize the small-scale fading behavior of the channel. These new parameters can be used for 
diversity techniques, fading rate estimation, and other space-time techniques. Maximum 
AOA provides the direction of the multipath component with the maximum power. It can be 
used in system installation to minimize the path loss. The results of measurements for the 
parameters defined by Equations (3.5 1)— (3.57) are given in Table 3.22-3.24 and Figure 3.20. 
More details on the topic can be found in References [74-85]. 



3.9 UWB CHANNEL MODEL 

UWB channel parameters will be discussed initially based on measurements results in 
Cassioli et cil. [86]. The measurements environment is presented in Figure 3.21 and the 
signal format used in these experiments in Figure 3.22. The repetition rate of the pulses is 
2 x 10 6 pulses/s, implying that multipath spreads up to 500 ns could have been observed 
unambiguously. Multipath profiles with a duration of 300 ns were measured. Multipath 
profiles were measured at various locations in 14 rooms and hallways on one floor of the 
building presented in Figure 3.21. Each of the rooms is labeled alpha-numerically. Walls 
around offices are framed with metal studs and covered with plaster board. The wall around 
the laboratory is made from acoustically silenced heavy cement block. There are steel core 
support pillars throughout the building, notably along the outside wall and two within the 
laboratory itself. The shield room's walls and door are metallic. The transmitter is kept 
stationary in the central location of the building near a computer server in a laboratory 
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ure 3.20 Scatter plot of the measured path loss values. 




■-"^ Approximate location of measurements 



ure 3.21 The floor plan of a typical modern office building where the propagation mea- 
surement experiment was performed. The concentric circles are centered on the 
transmit antenna and are spaced at 1 m intervals. (Reproduced by permission 
of IEEE [87].) 
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Figure 3.22 The transmitted pulse measured by the receiving antenna located 1 m away 
from the transmitting antenna with the same height. 

denoted by F. The transmit antenna is located 165 cm from the floor and 105 cm from the 
ceiling. 

In each receiver location, impulse response measurements were made at 49 measurement 
points, arranged in a fixed-height, 7x7 square grid with 15 cm spacing, covering 90 x 
90 cm. A total of 741 different impulse responses were recorded. One side of the grid is 
always parallel to north wall of the room. The receiving antenna is located 120 cm from the 
floor and 150 cm from the ceiling. 

3.9.1 The large-scale statistics 

Experimental results show that all small-scale averaging SSA-PDPs exhibit an exponential 
decay as a function of the excess delay. Since we perform a delay axis translation, the 
direct path always falls in the first bin in all the PDPs. It also turns out that the direct 
path is always the strongest path in the 14 SSA-PDPs even if the LOS is obstructed. The 
energy of the subsequent MPCs decays exponentially with delay, starting from the second 
bin. Let Gk = As p . d {Gk] be the locally averaged energy gain, where As pa {-} denotes the 
spatial average over the 49 locations of the measurement grid. The average energy of 
the second MPC may be expressed as fraction rof the average energy of the direct path, 
i.e. r — G2/G1. We refer to r as the power ratio. As we will show later, the SSA-PDP is 
completely characterized by G \ , the power ratio r , and the decay constant e (or equivalently, 
by the total average received energy Gtot> r and £). 

The power ratio rand the decay constant e vary from location to location, and should 
be treated as stochastic variables. As only 14 values for e and r were available, it was 
not possible to extract the shape of their distribution from the measurement data. Instead, 
a model was assumed a priori and the parameters of this distribution were fitted. It was 
found that the log-normal distribution, denoted by e ~ £^f(fi BiB ; a Em ), with /x EdB = 16.1 and 
cr £dB = 1.27, gives the best agreement with the empirical distribution. Applying the same 
procedure to characterize the power ratios rs, it was found that they are also log-normally 
distributed, i.e. r ~ Cj^(fi rm ;a rjB ), with ji rm — —4 and a rdB — 3, respectively. 
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Table 3.24 Measured penetration losses and results from literature 



Material 


Penetration loss 


Reference 


Composite wall with studs not in the path 


8.8 




[72] 


Composite wall with studs in the path 


35.5 


dB 


[72] 


Glass door 


2.5 


dB 


[72] 


Concrete wall 1 week after concreting 


73.6 


dB 


[75] 


Concrete wall 2 weeks after concreting 


68.4 


dB 


[75] 


Concrete wall 5 weeks after concreting 


46.5 


dB 


[75] 


Concrete wall 14 months after concreting 


28.1 


dB 


[75] 


Plasterboard wall 


5.4-8.1 


dB 


[76] 


Partition of glass wool with plywood surfaces 


9.2-10.1 


dB 


[76] 


Partition of cloth-covered plywood 


3.9-8.7 


dB 


[76] 


Granite with width of 3 cm 


>30 


dB 


[77] 


Glass 


1.7-4.5 


dB 


[77] 


lVlptalizpd plass 


>30 


dB 


T771 

L ' ' J 


Wooden panels 


6.2-8.6 


dB 


[77] 


Brick with width of 1 1 cm 


17 


dB 


[77] 


Limestone with width of 3 cm 


>30 


dB 


[77] 


Concrete 


>30 


dB 


[77] 



By integrating the SS A-PDP of each room over all delay bins, the total average energy 
G to t within each room is obtained. Then its dependence on the TR separation is analyzed. 
It was found that a breakpoint model, commonly referred to as dual slope model, can be 
adopted for path loss PL as a function of the distance, as 

_ { 20.41og 10 (rfM,), d < 11 m 

l-56 + 741og 10 (d/db), d>Um { ' 

where PL is expressed in decibels, do — 1 m is the reference distance, and d is the TR separa- 
tion distance in meters. Because of the shadowing phenomenon, the G tot varies statistically 
around the value given by Equation (3.59). A common model for shadowing is log-normal 
distribution [87, 88]. By assuming such a model, it was found that G to t is log-normally dis- 
tributed about Equation (3.58), with a standard deviation of the associated normal random 
variable equal to 4.3. 



3.9.2 The small-scale statistics 

The differences between the PDPs at the different points of the measurement grid are caused 
by small-scale fading. In 'narrowband' models, it is usually assumed that the magnitude 
of the first (quasi-LOS) multipath component follows Rician or Nakagami statistics and 
the later components are assumed to have Rayleigh statistics [89]. However, in UWB 
propagation, each resolved MPC is due to a small number of scatterers, and the amplitude 
distribution in each delay bin differs markedly from the Rayleigh distribution. In fact, 
the presented analysis showed that the best-fit distribution of the small-scale magnitude 
statistics is the Nakagami distribution [90], corresponding to a gamma distribution of the 
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4.5 




100 150 200 
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300 



Figure 3.23 Scatter plot of the m-Nakagami of the best fit distribution vs excess delay 
for all the bins except the LOS components. Different markers correspond to 
measurements in different rooms. (Reproduced by permission of IEEE [86].) 



energy gains. This distribution has been used to model the magnitude statistics in mobile 
radio when the conditions of the central limit theorem are not fulfilled [91]. The parameters 
of the gamma distribution vary from bin to bin: V(£2; m) denotes the gamma distribution 
that fits the energy gains of the local PDPs in the fcth bin within each room. The £1^ are 
given as Q,t — Gk, i.e. the magnitude of the SSA-PDP in the A:th bin. The are related to 
the variance of the energy gain of the kth bin. Figure 3.23 shows the scatter plot of the m^, 
as a function of excess delay for all the bins (except the LOS components). It can be seen 
from Figure 3.23 that the m^, values range between 1 and 6 (rarely 0.5), decreasing with 
the increasing excess delay. This implies that MPCs arriving with large excess delays are 
more diffused than the first arriving components, which agrees with intuition. 

The mi parameters of the gamma distributions themselves are random variables dis- 
tributed according to a truncated Gaussian distribution, denoted by m ~ T^{fj, m ; a^), i.e. 
their distribution looks like a Gaussian for m > 0.5 and zero elsewhere 



f m (x) = £" ,e 



if x > 0.5 
otherwise 



(3.59) 



where the normalization constant K m is chosen so that the integral over the f m (x) is unity. 
The mean and variance of such Gaussian distributions that fit the as a function of the 
excess delay, are given by [17] 



H-miXk) = 3.5 



73 



(3.59a) 



tr?(r t )=1.84- 



160 



(3.59b) 



where the unit of r* is nanoseconds. 
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3.9.3 The statistical model 

The received signal is a sum of the replicas (echoes) of the transmitted signal, being related 
to the reflecting, scattering and/or deflecting objects via which the signal propagates. Each 
of the echoes is related to a single such object. In a narrowband system, the echoes at the 
receiver are only attenuated, phase-shifted and delayed, but undistorted, so that the received 
signal may be modeled as a linear combination of A^ pat h-delayed basic waveforms w(t) 

Apath 

r(t) = ^2 c ' u '( f - r /) + "(0 (3.60) 

1 = 1 

where n(t) is the observation noise. In UWB systems, the frequency selectivity of the 
reflection, scattering and/or diffraction coefficients of the objects via which the signal 
propagates can lead to a distortion of the transmitted pulses. Furthermore, the distortion 
and, thus, the shape of the arriving echoes, varies from echo to echo. The received signal is 
thus given as 

jVpath 

r(t) = ^ciWiit - r,-) + n{t) (3.61) 
i=i 

If the pulse distortion was greater than the width of the delay bins (2 ns), one would observe 
a significant correlation between adjacent bins. The fact that the correlation coefficient 
remains very low for all analyzed sets of the data implies that the distortion of a pulse due 
to a single echo is not significant, so that in the following, Equation (3.60) can be used. The 
SSA-PDP of the channel may be expressed as 

g(T) = J2 G kS(r-t k ) (3.62) 

k=\ 

where the function £(r ) can be interpreted as the average energy received at a certain receiver 
position and a delay r, normalized to the total energy received at one meter distance, and 
Afbins is the total number of bins in the observation window. Assuming an exponential decay 
starting from the second bin, we have 

N bi „ s _ 

g(r) = Gi5(r - n) + Gi exp[-(T* - r 2 )/ £ ]5(r - t k ) (3.63) 

k=2 

where e is the decay constant of the SSA-PDP. The total average energy received over the 
observation interval T is 

p T _ A'bins _ 

Gtot = / I(r ) dr = G i + V G 2 exp[-(r* - t 2 )/e] (3.64) 
Jo k=2 

Summing the geometric series gives 

G M = Gi[l + rF(e)] (3.65) 

where r — G2/G1 is the power ratio, and 

m , 1 - exp[-(jV blns - l)AT/e] _ 1 „ _ 

F(s) — « (3.66) 

1 — exp(— Ar/s) 1 — exp(— Ar/e) 
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The total normalized average energy is log-normally distributed, due to the shadowing, 
around the mean value given from the path loss model (3.58) 



G t0 ,~ C M (-PL; 4.3) 
From Equation (3.10), we have for the average energy gains 

G to t 



G k = 



1 + rF(e) 
G t ot 



1 + rF(s) 
and Equation (3.62) becomes 



for k — 1 

re -ta-T 2 )/ £; for £ = 2, ...,N bh 



l+rF(e) 



I AW , 

j 8(r - ti) + [re 

k=2 



-l(n-r 2 )/e] 



] S(r - t k ) 



(3.67) 



(3.68) 



(3.69) 



3.9.4 Simulation steps 

In the model, the local PDF is fully characterized by the pairs {G k , t k }, where x k = 
(k— 1)At with Ar = 2 ns. The G k are generated by a superposition of large and small- 
scale statistics. The process starts by generating the total mean energy G tot at a certain 
distance according to Equation (3.67). Next, the decay constant e and the power ratio rare 
generated as lognormal distributed random numbers 

£~£aK16.1; 1.27) (3.70) 
r ~ C M (-4; 3) (3.71) 

The width of the observation window is set at T — 5s. Thus, the SSA-PDP is completely 
specified according to Equation (3.69). Finally, the local PDPs are generated by computing 
the normalized energy gains G k of every bin k and every location i as gamma-distributed 
independent variables. The gamma distributions have the average given by Equation (3.68), 
and the m^s are generated as independent truncated Gaussian random variables 

m k ~Tx (n m (z k ); ol{T k j) (3.72) 

with fi m (tk) and (r^(r k ) given by Equation (3.59). These steps are summarized Table 3.25. 
Some results are shown in Figure 3.24. 



3.9.5 Clustering models for the indoor multipath propagation channel 

A number of models for the indoor multipath propagation channel [92-96] have reported 
a clustering of multipath components, in both time and angle. In the model presented in 
Spencer et al. [95], the received signal amplitude is a Rayleigh-distributed random 
variable with a mean-square value that obeys a double exponential decay law, according to 

^2"= £2(0, 0)e- T,/r e~ Ttl/y (3.73) 

where /J 2 (0, 0) describes the average power of the first arrival of the first cluster, 7} represents 
the arrival time of the Zth cluster, and xu is the arrival time of the A:th arrival within the 



Table 3.25 Statistical models and parameters 



Global parameters =>■ G to t and G k 



Path loss PL = 



20Alog l0 (d/d 0 ), d < 11 m 
-56 + 741og 10 (rf/c/ 0 ), d > 11 m 
Shadowing G tot ~ Cm(-PL; 4.3) 

Decay constant e ~ £jv(16.1; 1.27) 

Power ratio r ~ Cj^(— 4; 3) 



Local parameters =>• G k 
Energy gains G* ~ Y{G k \ m k ) 

m Values Mmfe) = 3.5 - — 

^ )=L84 -l| 



(a) 
X10" 3 




Excess delay (ns) 




Excess delay (ns) 



Figure 3.24 (a) The measured 49 local PDPs for an example room, (b) Simulated 49 local 
PDPs for an example room. (Reproduced by permission of IEEE [86].) 
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Zth cluster, relative to 7} . The parameters F and y determine the intercluster signal level 
rate of decay and the intracluster rate of decay, respectively. The parameter T is generally 
determined by the architecture of the building, while y is determined by objects close to 
the receiving antenna, such as furniture. The results presented in Spencer et al. [95] make 
the assumption that the channel impulse response as a function of time and azimuth angle 
is a separable function, or 

h(t,9) = h(t)h(6) (3.74) 

from which independent descriptions of the multipath time-of-arrival and angle-of-arrival 
are developed. This is justified by observing that the angular deviation of the signal arrivals 
within a cluster from the cluster mean does not increase as a function of time. 

The cluster decay rate Y and the ray decay rate y can be interpreted for the environment 
in which the measurements were made. For the results, presented later in this section, at 
least one wall separates the transmitter and the receiver. Each cluster can be viewed as a 
path that exists between the transmitter and the receiver along which signals propagate. This 
cluster path is generally a function of the architecture of the building itself. The component 
arrivals within a cluster vary because of secondary effects, e.g. reflections from the furniture 
or other objects. The primary source of degradation in the propagation through the features 
of the building is captured in the decay exponent Y . Relative effects between paths in the 
same cluster do not always involve the penetration of additional obstructions or additional 
reflections, and therefore tend to contribute less to the decay of the component signals. 
Results for p{9) generated from the data in Cramer et al. [97] are shown in Figure 3.25. 
Interarrival times are hypothesized [95] to follow exponential rate laws, given by 

p(T l \T l _ 1 ) = Ae- A(r <- r «> 
p{T u \T k -u) = \z- x{T i- T >-^ (3.75) 

where A is the cluster arrival rate and X is the ray arrival rate. Channel parameters are 
summarized in Table 3.26. 



(a) (b) 




9 (degrees) Cluster angle-of-arrival relative to reference cluster 



Figure 3.25 (a) Ray arrival angles at 1° of resolution and a best fit Laplacian density with 
a — 38°. (b) Distribution of the cluster azimuth angle-of-arrival, relative to the 
reference cluster. (Reproduced by permission of IEEE [97].) 
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Table 3.26 Channel parameters 



Parameter UWB [97] Spencer et al. [95] Spencer et al. [95] Saleh and Valenzuela [94] 



r 


27.9 ns 


33.6 ns 


78.0 ns 


60 ns 


Y 


84.1 ns 


28.6 ns 


82.2 ns 


20 ns 


1/A 


45.5 ns 


16.8 ns 


17.3 ns 


300 ns 


l/X 


2.3 us 


5.1 ns 


6.6 ns 


5 ns 


a 


37° 


25.5° 


21.5° 





3.9.6 Path loss modeling 

In this section we are interested in a transceiver operating at approximately 2 GHz center 
frequency with a bandwidth in excess of 1 .5 GHz, which translates to sab-nanosecond time 
resolution in the CIRs. 

3. 9. 6. 1 Measurement procedure 

The measurement campaign is described in Yano [98] and was conducted in a single-floor, 
hard-partition office building (fully furnished). The walls were constructed of drywall with 
vertical metal studs; there was a suspended ceiling 10 feet in height with carpeted concrete 
floor. Measurements were conducted with a stationary receiver and mobile transmitter; both 
transmit and receive antennas were 5 feet above the floor. For each measurement, a 300 ns 
time-domain scan was recorded and the LOS distance from transmitter to receiver was 
recorded. A total of 906 profiles were included in the dataset with seven different receiver 
locations recorded over the course of several days. Except for a reference measurement 
made for each receiver location, all successive measurements were NLOS links chosen 
randomly throughout the office layout that penetrated anywhere from one to five walls. The 
remainder of datapoints were taken in similar fashion. 

3.9.6.2 Path loss modeling 

The average pathless for an arbitrary TR separation is expressed using the power law as 
a function of distance. The indoor environment measurements show that, at any given d, 
shadowing leads to signals with a path loss that is log-normally distributed about the mean 
[99]. That is: 

PL {d) = PL 0 (d 0 ) + ION log (J-^j + X a (3.76) 

where Ms the pathloss exponent, X a is a zero mean log-normally distributed random 
variable with standard deviation er (dB), PLq is the free space path loss at reference distance, 
do. Some results are shown in Figure 3.27. 

Assuming a simple RAKE with four correlators where each component is weighted 
equally, we can calculate the path loss vs distance using the peak channel impulse response 
(CIR) power plus RAKE gain, PL PEA k+rake, for each CIR, as shown in Figure 3.26(c). The 




Figure 3.26 (a) Peak PL vs distance; (b) total PL vs distance; (c) peak PL + rake gain vs 
distance. (Reproduced by permission of IEEE [98].) 



92 



CHANNEL MODELING FOR 4G 




Figure 3.27 (a) The RMS delay spread vs distance; (b) RMS delay spread vs path loss. 
(Reproduced by permission of IEEE [98].) 



exponent N obtained from performing a least-squares fit is 2.5 with a standard deviation of 
4.04 dB. The results for delays are shown in Figure 3.27. 



3.9.6.3 In-home channel 

For the in-home channel Equation (3.76) can be also used to model path loss. Some results 
are shown in Table 3.27 [100-103]. Table 3.28 presents the results for delay spread in home 
channel [100]. 
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Table 3.27 Statistical values of the path loss parameters 





T AC 




Ml /~vo 




Mean 


SD 


Mean 


SD 


PL 0 (dB) 


47 




51 




N 


1.7 


0.3 


3.5 


0.97 


or(dB) 


1.6 


0.5 


2.7 


0.98 



Table 3.28 Percentage of power contained in profile, number of paths, mean excess delay 
and RMS delay spread for 5, 10, 15, 20 and 30 dB threshold levels 

50 % NLOS 90 % NLOS 



Threshold Power % L T m (ns) trms(hs) Power % L T m (ns) tRMs(ns) 



5dB 


46.8 


7 


1.95 


1.52 


46.9 


8 


2.2 


1.65 


10 dB 


89.2 


27 


7.1 


5.77 


86.5 


31 


8.1 


6.7 


15 dB 


97.3 


39 


8.6 


7.48 


96 


48 


10.3 


9.3 


20 dB 


99.4 


48 


9.87 


8.14 


99.5 


69 


12.2 


11 


30 dB 


99.97 


60 


10.83 


8.43 


99.96 


82 


12.4 


11.5 



REFERENCES 

[1] S. Glisic, Advanced Wireless Communications, 4G Technology. Wiley: Chichester 
2004. 

[2] K.I. Pedersen, PE. Mogensen and B .H. Fleury, A stochastic model of the temporal and 
azimuthal dispersion seen at the base station in outdoor propagation environments, 
IEEE Trans. Vehicle TechnoL, vol. 49, 2000, pp. 437-447. 

[3] F. Frederiksen, P. Mogensen, K.I. Pedersen and P. Leth-Espensen, A software testbed 
for performance evaluation of adaptive antennas in FH GSM and wideband-CDMA, 
in Proc. 3rd ACTS Mobile Communication Summit, vol. 2, Rhodes, June 1998, 
pp. 430-435. 

[4] A. Algans, K.I. Pedersen and PE. Mogensen, Experimental analysis of the joint 

statistical properties of azimuth spread, delay spread, and shadow fading, IEEE J. 

Select. Areas Commun., vol. 20, 2002, pp. 523-531. 
[5] D.C Cox, R. Murray and A. Norris, 800 MHz attenuation measured in and around 

sub-urban houses, AT&T Bell Lab. Tech. J., vol. 673, 1994. 
[6] R.C. Bernhardt, Macroscopic diversity in frequency reuse systems, IEEE J. Select. 

Areas Commun., vol. 5, 1987, pp. 862-878. 
[7] H. L. Bertoni, Radio Propagation for Modern Wireless System. Prentice-Hall: 

Englewood Cliffs, NJ, 2000. 
[8] L. Correia, Wireless Flexible Personalised Communications-Cost 259 Final Report. 

Wiley: New York, 2001. 
[9] J. Winters, Optimum combining in digital mobile radio with cochannel interference 

IEEE Trans. Vehicle TechnoL, vol. VT-33, 1984. 



94 



CHANNEL MODELING FOR 4G 



[10] K.I. Pedersen, RE. Mogensen and B.H. Fleury, Spatial channel characteristics in 
outdoor environments and their impact on BS antenna system performance, in Proc. 
IEEE Vehicular Technology Conf. (VTC'98), Ottawa, May 1998, pp. 719-724. 

[11] K.I. Pedersen and RE. Mogensen, Evaluation of vector- RAKE receivers using dif- 
ferent antenna array configurations and combining schemes, Int. J. Wireless Inform. 
Networks, vol. 6. 1999, pp. 181-194. 

[12] V. Veen and K.M. Buckley, Bcamforming: a versatile approach to spatial filtering, 
IEEE Acoust., Speech Signal Processing (ASSP) Mag., vol, 5, 1988, pp. 4-24. 

[13] J. Liberti and T. Rappaport, A geometrically based model for line-of-sight multipath 
radio channels, in Proc. IEEE Vehicular Technology Conf. (VTC 96), May 1996, 
pp. 844-848. 

[14] M. Lu, T. Lo and J. Litva, A physical spatio-temporal model of multipath propagation 
channels, in Proc. IEEE Vehicular Technology Conf. (VTC 97), May 1997, pp. 810- 
814. 

[15] O. N0rklit and J.B. Andersen, Diffuse channel model and experimential results for 
antenna arrays in mobile environments, IEEE Trans. Antennas Propagat., vol. 96, 
1998, pp. 834-840. 

[16] C. Cheon, H.L. Bertoni and G. Liang, Monte Carlo simulation of delay and angle 

spread in different building environments, in Proc. IEEE Vehicular Technology Conf. 

(VTC'OO), Boston, MA, vol.1, 20-28 September 2000, pp. 49-56. 
[17] U. Martin, A directional radio channel model for densely built-up urban areas, in 

Proc. European Personal Mobile Communications Conf. (EPMCC), Bonn, October 

1997, pp. 237-244. 

[18] M. Gudmundson, Correlation model for shadow fading in mobile radio systems, 

IEEE Electron. Lett., vol. 27, 1992, pp. 2126-2145. 
[19] A. Mawira, Models fur the spatial correlation functions of the log-normal component 

of the variability of VHF/UHF field strength in urban environment, in Proc. Personal, 

Indoor and Mobile Radio Communications (PIMRC'92), London, 1992, pp. 436- 

440. 

[20] A. Gehring, M. Steinbauer, I. Gaspard and M. Grigat, Empirical channel stationarity 

in urban environments, in Proc. European Personal Mobile Communications Conf. 

(EPMCC), Vienna, February, 2001. 
[21] T.B. S0rensen, Correlation model for shadow fading in a small urban macro cell, 

in Proc. Personal Indoor and Mobile Radio Communications (PIMRC'98), Boston, 

MA, 1998. 

[22] Algorithms and antenna array recommendations, Public deliverable from European 
ACTS, TSUNAMI II Project, Deliverable code: AC020/AUC/A1.2/DR/P/005/M, 
May 1997. 

[23] Commission of the European Communities, Information technologies and sciences - 

digital land mobile radio communications, COST 207, 1998. 
[24] PA. Bello, Characterization of randomly time-variant linear channels, IEEE Trans. 

Commun. Syst., vol. CS-11, 1963, pp. 360-393. 
[25] J. A. Fessler and A. Hero, Space-alternating generalised expectation-maximization 

algorithm, IEEE Trans. Signal Processing, vol. 42, 1994, pp. 2664-2677. 
[26] B.H. Fleury, M. Tschudin, R. Heddergott, D. Dahlhaus and K. I. Pedersen, Channel 

parameter estimation in mobile radio environments using the SAGE algorithm, IEEE 

J. Select. Areas Commun., vol. 17, 1999, pp. 434^50. 



REFERENCES 95 



[27] K.I. Pedersen, B.H. Fleury and RE. Mogensen, High resolution ot electromagnetic 
waves in time-varying radio channels, in Proc. Int. Symp. Personal, Indoor and 
Mobile Radio Communications (PIMRC'97), Helsinki, September 1997, pp. 650- 
654. 

[28] K.I. Pedersen, RE. Mogensen , B.H. Fleury, F. Frederiksen and K. Olesen, Analysis 
of time, azimuth and doppler dispersion in outdoor radio channels, in Proc. ACTS 
Mobile Communication Summit '97, Aalborg, October 1997, pp. 308-313. 

[29] M. Hata, Empirical formula for propagation loss in land mobile radio service, IEEE 
Trans. Vehicle Technol., vol. VT-29, 1980, pp. 317-325. 

[30] LJ. Greenstein, V. Erceg, Y.S. Yeh and M.V. Clark, A new path-gain/delay-spread 
propagation model for digital cellular channels, IEEE Trans. Vehicle Technol. , vol. 46, 
1997, pp. 477-485. 

[31] M. Toeltsch, J. Laurila, K. Kalliola, A.F. Molisch, P. Vainikainen and E. Bonek, 
Statistical characterization of urban spatial radio channels, IEEE J. Select. Areas 
Commun., vol. 20, no. 3, 2002, pp. 539-549. 

[32] R. Roy, A. Paulraj and T. Kailath, ESPRIT - a subspace rotation approach to estima- 
tion of parameters of cisoids in noise, IEEE Trans. Acoust, Speech, Signal Process., 
vol. 32, 1986, pp. 1340-1342. 

[33] M. Zoltowski, M. Haardt and C. Mathews, Closed-form 2-D angle estimation with 
rectangular arrays in element space or beamspace via unitary ESPRIT, IEEE Trans. 
Signal Process., vol. 44, 1994, pp. 316-328. 

[34] M. Haardt and J. Nossek, Unitary ESPRIT: how to obtain an increased estimation 
accuracy with a reduced computational burden, IEEE Trans. Signal Process. , vol. 43, 
1995, pp. 1232-1242. 

[35] S.C. Swales, M. Beach, D. Edwards and J. P. McGeehan, The performance enhance- 
ment of multibeam adaptive base-station antennas for cellular land mobile ratio sys- 
tems, IEEE Trans. Vehicle Technol, vol. 39, 1990, pp. 56-67. 

[36] R.B. Ertel, P. Cardieri, K.W. Sowerby, T.S. Rappaport and J.H. Reed, Overview of 
spatial channel models for antenna array communications systems, IEEE Personal 
Commun., vol. 5, no. 1, 1998, pp. 10-22. 

[37] U. Martin, J. Fuhl, I. Gaspard, M. Haardt, A. Kuchar, C. Math, A.F. Molisch and 
R. Thoma, Model scenarios for direction-selective adaptive antennas in cellular mo- 
bile communication systems - scanning the literature, Wireless Personal Commun. 
Mag. (Special Issue on Space Division Multiple Access), vol. 11, no. 1, 1999, pp. 109- 
129. 

[38] K. Pedersen, P. Mogensen and B. Fleury, A stochastic model of the temporal and 
azimuthal dispersion seen at the base station in outdoor propagation environments, 
IEEE Trans. Vehicle Technol., vol. 49, no. 2, 2000, pp. 437-447. 

[39] J. Fuhl, J. -P. Rossi and E. Bonek, High resolution 3-D direction-of-arrival deter- 
mination for urban mobile radio, IEEE Trans. Antennas Propagat., vol. 4, 1997, 
pp. 672-682. 

[40] A. Kuchar, J. -P. Rossi and E. Bonek, Directional macro-cell channel characterization 
from urban measurements, IEEE Trans. Antennas Propagat., vol. 48, 2000, 137— 
146. 

[41] K. Kalliola, H. Laitinen, L. Vaskelainen and P. Vainikainen, Real-time 3D spatial- 
temporal dual-polarized measurement of wideband radio channel at mobile station, 
IEEE Trans. Instrum. Meas., vol. 49, 2000, pp. 439-448. 



96 



CHANNEL MODELING FOR 4G 



[42] J. Kivinen, T. Korhonen, P. Aikio, R. Gruber, P. Vainikainen and S.-G. Haggman, 
Wideband radio channel measurement system at 2 GHz, IEEE Trans. lustrum. Meas., 
vol. 48, 1999, pp. 39-44. 

[43] J. P. Kermoal, L. Schumacher, K.I. Pedersen, PE. Mogensen and F. Frederiksen, A 
stochastic MIMO radio channel model with experimental validation, IEEE J. Selected 
Areas Commun., vol. 20, 2002, pp. 1211-1226. 

[44] K. Yu, M. Bengtsson, B. Ottersten, D. McNamara, P. Karlsson and M. Beach, Second 
order statistics of NLOS indoor MIMO channels based on 5.2 GHz measurements, 
in Proc. GLOBECOM'01, San Antonio, TX, 2001, pp. 156-160. 

[45] F. Frederiksen, P. Mogensen, K.I. Pedersen and P. Leth-Espensen, A 'Software' 
testbed for performance evaluation of adaptive antennas in FH GSM and wideband- 
CDMA, in Conf. Proc. 3rd ACTS Mobile Communication Summit, vol. 2, Rhodes, 
June 1998, pp. 430-435. 

[46] J.B. Andersen, Array gain and capacity for known random channels with multiple 
element arrays at both ends, IEEE J. Select. Areas Commun., vol. 18, 2000, pp. 2172- 
2178. 

[47] X. Zhao, J. Kivinen, P. Vainikainen and K. Skog, Propagation characteristics for wide- 
band outdoor mobile communications at 5.3 GHz, IEEE J. Selected Areas Commun., 
vol. 20, 2002, pp. 507-514. 

[48] H. Suzuki, A statistical model for urban radio propagation, IEEE Trans. Cammun., 
vol. 25, 1977, pp. 673-680. 

[49] J. Takada, Fu Jiye, Zhu Houtao T. Kobayashi, Spatio-temporal channel characteri- 
zation in a suburban non line-of-sight microcellular environment, IEEE J. Selected 
Areas Commun., vol. 20, no. 3, 2002, pp. 532-538. 

[50] J. Kivinen, X. Zhao and P. Vainikainen, Empirical characterization of wideband 
indoor radio channel at 5.3 GHz, IEEE Trans. Antennas Propagat., vol. 49, 2001, 
pp. 1192-1203. 

[51] H. Masui, K. Takahashi, S. Takahashi, K. Kage and T. Kobayashi, Delay profile 
measurement system for microwave broadband transmission and analysis of delay 
characteristics in an urban environment, IEICE Trans. Electron., vol. E82-C, 1999, 
pp. 1287-1292. 

[52] A.F Molisch, M. Steinbauer, M. Toeltsch, E. Bonek and R.S. Thoma, Capacity of 
MIMO systems based on measured wireless channels, IEEE J. Select. Areas Com- 
mun., vol. 20, 2002, pp. 561-569. 

[53] J. Parks-Gornet and I.N. Imam, Using rank factorization in calculating the Moore- 
Penrose generalized inverse, in IEEE Southeastcon '89. 'Energy and Information 
Technologies in the Southeast' , vol. 2, 9-12 April 1989, pp. 427-431. 

[54] J. Tokarzewski, System zeros analysis via the Moore-Penrose pseudoinverse and 
SVD of the first nonzero Markov parameter, IEEE Trans. Autom. Control, vol. 43, 
no. 9, 1998, pp. 1285-1291. 

[55] L.P Withers, Jr, A parallel algorithm for generalized inverses of matrices, with appli- 
cations to optimum beamforming, in IEEE Int. Conf. Acoustics, Speech, and Signal 
Processing, ICASSP-93, vol. 1, 27-30 April 1993, pp. 369-372. 

[56] Shu Wang and Xilang Zhou, Extending ESPRIT algorithm by using virtual array and 
Moore-Penrose general inverse techniques, in IEEE Southeastcon '99, 25-28 March 
1999, pp. 315-318. 

[57] M. Steinbauer, A.F. Molisch and E. Bonek, The double-directional radio channel, 
IEEE Antennas Propagat. Mag., 2001, pp. 51-63. 



REFERENCES 97 



[58] M.L. Rubio, A. Garcia-Armada, R.P. Torres and J.L. Garcia, Channel modeling and 
characterization at 17 GHz for indoor broadband WLAN, IEEE J. Select. Areas 
Commun., vol. 20, 2002, pp. 593-601. 

[59] H. Hashemi, The indoor radio propagation channel, Proc. IEEE, vol. 81, no. 7, 1993, 
pp. 943-968. 

[60] K. Bury, Statistical Distributions in Engineering. Cambridge University Press: 
Cambridge, 1999. 

[61] R.P. Torres, L. Valle, M. Domingo, S. Loredo and M. C Diez, CIN-DOOR: an engi- 
neering tool for planning and design of wireless systems in enclosed spaces, IEEE 
Anennas Propagat. Mag., vol. 41, 1999, pp. 11-22. 

[62] S. Loredo. R.P. Torres, M. Domingo, L. Valle and J.R. Perez, Measurements and 
predictions of the local mean power and small-scale fading statistics in indoor wireless 
environments, Microvave Opt. Technol. Lett., vol. 24, 2000, pp. 329-331. 

[63] R.R. Torres, S. Loredo, L. Valle and M. Domingo, An accurate and efficient method 
based on ray-tracing for the prediction of local flat-fading statistic in picocell radio 
channels, IEEE J. Select. Areas Commun., vol. 18, 2001, pp. 170-178. 

[64] S. Loredo, L. Valle and R.P. Torres, Accuracy analysis of GO/UTD radio channel 
modeling in indoor scenarios at 1.8 and 2.5 GHz, IEEE Anennas Propagat. Mag., 
vol. 43, 2001, pp. 37-51. 

[65] A. Bohdanowicz, Wide band indoor and outdoor radio channel measurements at 
17 GHz, Ubicom. Technical Report/2000/2, Febuary 2000. 

[66] A. Bohdanowicz, G. J. M. Janssen and S. Pietrzyk, Wide band indoor and outdoor 
multipath channel measurements at 17 GHz, in Proc. IEEE Vehicular Technology 
Conf., Amsterdam, vol. 4, 1999, pp. 1998-2003. 

[67] L. Talbi and G. Y. Delisle, Experimental characterization of EHF multipath indoor 
radio channels, IEEE J. Select. Areas Commun., vol. 14, 1996, pp. 431-439. 

[68] T.S. Rappaport and CD. McGillem, UHF fading in factories, IEEE J. Select. Areas 
Commun., vol. 7, 1989, pp. 40-48. 

[69] T.S. Rappaport, Indoor radio communications for factories of the future, IEEE Com- 
mun. Mag., 1989, pp. 15-24. 

[70] A.F. Abou-Raddy and S. M. Elnoubi, Propagation measurements and channel mod- 
eling for indoor mobile radio at 10 GHz, in Proc. IEEE Vehicular Technology Conf., 
vol. 3, 1997, pp. 1395-1399. 

[71] A. A.M. Saleh and R.A. Valenzuela, A statistical model for indoor multipath propa- 
gation, IEEE J. Select. Areas Commun., vol. SAC-5, 1987, pp. 128-137. 

[72] Hao Xu, V. Kukshya and T.S. Rappaport, Spatial and temporal characteristics of 
60-GHz indoor channels, IEEE J. Selected Areas Commun., vol. 20, no. 3, 2002, 
pp. 620-630. 

[73] T.S. Rappaport, Wireless Communications: Principles and Practice. Prentice-Hall: 

Englewood Cliffs, NJ, 1996. 
[74] G. Durgin and T.S. Rappaport, Theory of multipath shape factors for small-scale 

fading wireless channels, IEEE Trans. Antennas Propagat., vol. 48, 2000, pp. 682- 

693. 

[75] T. Manabe, Y. Miura and T. Ihara, Effects of antenna directivity and polarization 
on indoor multipath propagation characteristics at 60 GHz, IEEE J. Select. Areas 
Commun., vol. 14, 1996, pp. 441^148. 

[76] K. Sato, T. Manabe, T. Ihara, H. Saito, S. Ito, T. Tanaka, K. Sugai, N. Ohmi, 
Y. Murakami, M. Shibayama, Y. Konishi and T. Kimura, "Measurements of reflection 



98 



CHANNEL MODELING FOR 4G 



and transmission characteristics of interior structures of office building in the 60-GHz 
band," IEEE Trans. Antennas Propagat., vol. 45, 1997, pp. 1783-1792. 
[77] B. Langen, G. Lober and W. Herzig, Reflection and transmission behavior of building 
materials at 60 GHz, in Proc. IEEEPIMRC'94, The Hague, September 1994, pp. 505- 
509. 

[78] J.P. Rossi, J.R Barbot and A.J. Levy, Theory and measurement of the angle of arrival 

and time delay of UHF radiowave using a ring array, IEEE Trans. Antennas Propagat. , 

vol. 45, 1997, pp. 876-884. 
[79] Y.L.C. de Jong and M.H.A.J. Herben, High-resolution angle-of-arrival measurement 

of the mobile radio channel, IEEE Trans. Antennas Propagat., vol. 47, 1999, pp. 

1677-1687. 

[80] H. Droste and G. Kadel, Measurement and analysis of wideband indoor propagation 

characteristics at 17 GHz and 60 GHz, in Proc. IEE Antennas Propagation, Conf. 

(publication no. 407), April 1995, pp. 288-291. 
[81] C.L. Holloway, RL. Perini, R.R. Delyzer and K.C. Allen, Analysis of composite walls 

and their effects on short-path propagation modeling, IEEE Trans. Vehicle Technol., 

vol. 46, 1997, pp. 730-738. 
[82] W. Honcharenko and H. Bertoni, Transmission and reflection characteristics at con- 
crete block walls in the UHF bands proposed for future PCS, IEEE Trans. Antennas 

Propagat, vol. 42, 1994, pp. 232-239. 
[83] P. Smulders and L. Correia, Characterization of propagation in 60 GHz radio channels, 

Electron. Commun. Eng. J., 1997, pp. 73-80. 
[84] H. Xu, T. S. Rappaport, R. J. Boyle and J. Schaffner, Measurements and modeling for 

38-GHz point-to-multipoint radiowave propagation, IEEE J. Select. Areas Commun., 

vol. 18, 2000, pp. 310-321. 
[85] H. Xu, V. Kukshya and T. Rappaport, Spatial and temporal characterization of 60 GHz 

channels, in Proc. IEEE VTC'2000, Boston, MA, vol. 1, 24-28 September, 2000, 

pp. 6-13. 

[86] D. Cassioli, M.Z. Win and A.F. Molisch, The ultra-wide bandwidth indoor channel: 
from statistical model to simulations, IEEE J. Selected Areas Commun., vol. 20, no. 6, 
2002, pp. 1247-1257. 

[87] L.J. Greenstein, V. Erceg, Y. S. Yeh and M. V. Clark, A new path-gain/delay-spread 
propagation model for digital cellular channels, IEEE Trans. Vehicle Technol. , vol. 46, 
1997, pp. 477-485. 

[88] V. Erceg, L.J. Greenstein, S.Y Tjandra, S.R. Parkoff, A. Gupta, B. Kulic, A. A. 
Julius and R. Bianchi, An empirically based path loss model for wireless channels 
in suburban environments, IEEE J. Select. Areas Commun., vol. 17, 1999, pp. 1205- 
1211. 

[89] E. Failli, ed., Digital Land Mobile Radio. Final Report of COST 207. Commission 

of the European Union: Luxemburg 1989. 
[90] M. Nakagami, The m -distribution - a general formula of intensity distribution of 

rapid fading, in Statistical Methods in Radio Wave Propagation, W. C. Hoffman, ed. 

Pergamon: Oxford, 1960, pp. 3-36. 
[91] W.R. Braun and U. Dersch, A physical mobile radio channel model, IEEE Trans. 

Vehicle. Technol, vol. 40, 1991, pp. 472-482. 
[92] H. Hashemi, The indoor radio propagation channel, Proc. IEEE, vol. 81, 1993, 

pp. 943-968. 



REFERENCES 99 



[93] J.C. Liberti and T.S. Rappaport, Smart Antennas for Wireless Communications: IS- 
95 and Third Generation CDMA Applications. Prentice-Hall: Englewood Cliffs, NJ, 
1999. 

[94] A.A.M. Saleh and R.A. Valenzuela, A statistical model for indoor multipath propa- 
gation, IEEE J. Select. Areas Commun., vol. 5, 1987, pp. 128-137. 

[95] Q. Spencer, M. Rice, B. Jeffs and M. Jensen, A statistical model for the angle-of- 
arrival in indoor multipath propagation, in Proc. IEEE Vehicle Technol. Conf., 1997, 
pp. 1415-1419. 

[96] Q. Spencer, B. Jeffs, M. Jensen and A. Swindlehurst, Modeling the statistical time 
and angle of arrival characteristics of an indoor multipath channel, IEEE J. Select. 
Areas Commun., vol. 18, 2000, pp. 347-360. 

[97] R.J.-M. Cramer, R.A. Scholtz and M.Z. Win, Evaluation of an ultra-wide-band prop- 
agation channel, IEEE Trans. Antennas Propagat., vol. 50, no. 5, 2002, pp. 561-570. 

[98] S.M. Yano, Investigating the ultra-wideband indoor wireless channel, IEEE 55th 
Vehicular Technology Conf., VTC, vol. 3, 6-9 May 2002, pp. 1200-1204. 

[99] D. Cassioli, M.Z. Win and M.F. Molisch, A statistical model for the UWB indoor 
channel, IEEE VTC, vol. 2, 2001, pp. 1 159-1 163. 
[100] S.S. Ghassemzadeh, R. Jana, C.W. Rice, W. Turin and V. Tarokh, A statistical path 
loss model for in-home UWB channels, 2002 IEEE Conf. Ultra Wideband Systems 
and Technologies, Digest of Papers, 21-23 May 2002, pp. 59-64. 
[101] W. Turin, R. Jana, S.S. Ghassemzadeh, C.W. Rice and T. Tarokh, Autoregressive 
modelling of an indoor UWB channel, IEEE Conference on Ultra Wideband Systems 
and Technologies, Digest of Papers, 21-23 May 2002, pp. 71-74. 
[102] S.S. Ghassemzadeh and V. Tarokh, UWB path loss characterization in residential 
environments, IEEE Radio Frequency Integrated Circuits (RFIC) Symp., 8-10 June 
2003, pp. 501-504. 

[103] S.S. Ghassemzadeh and V. Tarokh UWB path loss characterization in residential 
environments, IEEE MTT-S International Microwave Symposium Digest, vol. 1, 
8-13 June 2003, pp. 365-368. 

[104] G.H. Golub and C.F. Van Loan, Matrix Computations, 3rd edn. The Johns Hopkins 
University Press: Baltimore, MD, 1996. 

[105] P. Petrus, J.H. Reed and T.S. Rappaport, Effects of directional antennas at the base 
station on the Doppler spectrum, IEEE Commun. Lett., vol. 1, 1997, pp. 40-42. 

[106] W.C.Y. Lee, Estimate of local average power of a mobile radio signal, IEEE Trans. 
Vehicle Technol, vol. 34, 1985, pp. 22-27. 

[107] E. Green, Radio link design for micro-cellular systems, Br. Telecom. Technol. J., 
vol. 8, 1990, pp. 85-96. 

[108] U. Dersch and E. Zollinger, Physical characteristics of urban micro-cellular propa- 
gation, IEEE Trans. Antennas Propagat., vol. 42, 1994, pp. 1528-1539. 

[ 1 09] Y. Karasawa and H. Ivai, Formulation of spatial correlation statistic in Nakagami-Rice 
fading environments, IEEE Trans. Antennas Propagat., vol. 48, 2000, pp. 12-18. 



4 

Adaptive and Reconfigurable 

Link Layer 

4.1 LINK LAYER CAPACITY OF ADAPTIVE AIR INTERFACES 

In wireless systems, the channel reliability is affected by several phenomena, such as the 
propagation properties of the environment and mobility of terminals. To compensate for 
these impairments, various techniques are used at the physical layer, including adaptive 
schemes, which dynamically modify the transceiver structure. As a consequence of varying 
channel conditions and dynamically changing transceivers structures, the available infor- 
mation data rate at the link layer is in general time- varying. The channel seen from above the 
physical and link control layer, which will be referred to as a medium access control (MAC) 
channel, must be characterized with sufficient accuracy but still by a simple model that can 
be used in the analysis of the higher network layers. In the resulting MAC channel model 
used in this chapter, physical layer characteristics, as well as the physical channel and some 
implementation losses, are taken into account. The efficiency of the model is improved, 
avoiding bit level or signal level and detailed channel statistics calculations. The result is 
a model that can be easily used for analytical purposes, as well as for efficient modeling 
of the MAC channel behavior in network simulators. Owing to its modular structure, the 
analysis can be extended to more general and possibly complicated systems. In general, 
the physical layer, or layer 1 (LI), provides a virtual link of unreliable bits. For the sake of 
simplicity, in this chapter we will use the term physical layer (PHY) when referring to the 
protocol stack portions in which no distinction is made regarding the information carried 
by the bits. The portions in which such a distinction is made will be referred to as upper 
layers. This corresponds to L2-L7 of the International Standardization Organization-Open 
System of Interconnections (ISO-OSI) model. 

The model discussed in this chapter includes a physical channel and the physical layer 
(Figure 4.1) [1]. The same portion of the protocol stack is covered in a link layer model, 
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Figure 4. 1 The MAC channel model that includes the overall behavior of the physical 
channel and the PHY. According to the channel state, the PHY mode is chosen. 
As a consequence, in each state, data rate r, and error rate e, are defined. The 
model includes imperfections in the implementation of the adaptation method. 

called the effective capacity link model [2], which models directly some link layer pa- 
rameters used in queuing analysis without including imperfections of the physical layer 
or adaptation in the link layer. A similar definition of MAC channel is given in Liebl et 
al. [3], where a model for packet losses is included, taking into account physical channel, 
modulation and channel coding, and some other functions of the data link layer, but only 
for transceivers with fixed structure. 

In order to extend Gilbert [4], Gilbert-Elliot [5], Fritchman [6] or bipartite models [7], 
as hidden Markovian models [8], to a finer granularity of error rates or larger dynamic 
range with adaptation techniques at the physical layer, finite-state Markov chains with 
larger numbers of states have been introduced, e.g. in Wang and Moayeri [9]. The basic 
approach is to quantize the signal envelope or the range of the signal to interference-plus- 
noise ratio (SINR), and associate each state of the Markov chain with a given range, to 
model the packet error process in the presence of a fading channel. In Steffan [10], the 
error process is modeled as the 'arrival' of errors, with arrival rate varying according to a 
Markov-modulated process (MMP). In the existing literature, the Markov model is often 
considered for Ricean or Rayleigh fading channels. It has also been applied to other statistical 
models, such as Nakagami fading channels [11], or built from error traces obtained from 
measurements [12]. The validity of the first-order Markov model is addressed in References 
[13-18]. The accuracy of the model can be assessed by using information theoretic [15] or 
probabilistic approaches [18]. In Babich and Lombardi [18], it is reported that the first-order 
Markovian model, in which the states represent discrete, non-overlapping intervals of the 
signal envelope's amplitude, is not suitable to model very slow fading channels unless the 
analysis is carried out over a short time duration. However, it is also pointed out that this 
applies to bit-level systems, whereas the model can be valid for block-level systems. The 
performance of adaptive radio links has been studied in presence of an AWGN channel 
[19], or fading channels but without channel coding [20], or for coded systems analyzed 
under specific channel coding schemes and decoding methods [21, 22]. 
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The goal of this chapter is not to capture effects of higher frequency fading, but rather 
to represent the behavior of the signal quality level in terms of presence in a region, and to 
link it to the behavior of the service offered by the PHY in terms of data rate and error rate, 
with a granularity given by the number of PHY modes of the system. When the number 
of regions is small, e.g. less than eight, the quality level exhibits lower frequencies owing 
to practical constraints discussed in the following sections. In this chapter, a continuous, 
rather than discrete, time channel model is used. The benefit of this approach is 3-fold. First, 
the model may be integrated better in some broader analytical models, and the simulation 
model that can be derived from it can be efficiently integrated in event-driven simulators, 
which are usually more efficient. Second, the model presented in this chapter integrates both 
channel model and link adaptation in a flexible way open to generalizations. Third, some 
imperfections such as channel estimation error, estimation delay and feedback error, as well 
as implementation implications, like switching hysteresis, are included in the model. 

4.1.1 The MAC channel model 

We are interested in modeling the properties of the service capacity offered by the PHY to 
the upper layers, rather than in the rigorous characterization of the physical channel. The 
service capacity is basically given by the gross transmission bit rate and the errors at the 
receiver. In absence of link adaptation, i.e. with one fixed PHY mode, the gross bit rate 
is constant across channel states and the error rate absorbs the variability of the channel. 
Conversely, with link adaptation, the error rate is kept bounded whereas the bit rate changes 
with the state of the channel. Our model captures the dynamics of those metrics by modeling 
the channel as seen from above the PHY (Figure 4.1). The model therefore includes the 
physical channel and the PHY characteristics. In this chapter this model is referred to as 
the MAC channel. The MAC channel is modeled as a finite Markov chain (Figure 4.1), 
in which each state corresponds to a PHY mode and is hence associated with a specific 
transceiver configuration. In an adaptive modulation and coding (AMC) system, a mode 
corresponds to a modulation and channel coding schemes pair, and hence to the transmit 
bit rate and the error rate. The link service capacity characterized in this chapter, denoted 
R c (t ) and defined later, is actually a stochastic process R c (t ; £), which depends on the PHY 
mode in use, which is selected depending on the signal quality level estimate y(t; £). In the 
notation to follow, unless otherwise specified, the dependence on time t and realization £ 
will be omitted. 

4.1 .2 The Markovian model 

Packets or symbol channel errors may be observed at sampling instants using a model time 
unit equal to packet or symbol interval. This approach is often used for error models at 
both symbol and packet level error analysis [9, 23]. In this case, if a Markovian model is 
adopted, the resulting model is discrete time. 

Channel conditions may be represented by the value of some metric of the link quality 
y, whose domain is divided in non-overlapping regions. In the model considered in this 
chapter, changes in the state of the system coincide with boundary crossings of this metric. 
We model the true quality level as a continuous-time Markov chain with the state of the 
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model representing the signal quality region. Typical fading channels exhibit correlation 
between successive values and therefore the system cannot be considered as memory-less if 
samples of those values are analyzed. However, in our system the process state is represented 
by the y -region which corresponds to a PHY mode. This correspondence has been outlined 
above and will be described in detail in the following section. The number of PHY modes 
is typically small. For example, the high-speed downlink packet access (HSDPA) of the 
UMTS terrestrial radio access (UTRA) envisages seven modes, whereas IEEE 802.1 la and 
the ETSI HIPERLAN/2 have seven and eight modes, respectively. 

In continuous time model, the system does not sample the specific value of the quality 
metric, but rather the region in which the quality metric falls. It is clear that for a small 
number of states the higher frequency correlation in the physical channel is smoothed out 
when considering the jumps from one state to another. The assessment of the validity of the 
Markov property assumption may be feasible for simple statistical physical channel models, 
but may be impossible with more complicated channel models or for models derived from 
measurements. The framework presented in this chapter is intended to be applicable to 
generic channel processes that are continuous, based on known statistics or on heuristics, 
and that can be represented by a small number of states. The true channel quality level is 
modeled as a continuous time Markov chain (CTMC). 

Let us consider a continuous-time finite Markov chain obtained by a stochastic process 
y(t; f ) defined over a time interval T c (—00, +00) which assumes values on a continuous 
set T C K split in a finite number of contiguous, nonoverlapping intervals. If the process 
is continuous, then the CTMC is a birth-death process, since the new value assumed by 
a continuous function after an infinitesimal time interval At can only belong to the same 
or a neighboring interval. For our purpose, it is sufficient to assume that the process is 
continuous. So, looking at the time instants in which y crosses the target levels, the process 
can jump only to neighboring states. 

We now define the elements of the state transition rate matrix, Q, also known as 
infinitesimal generator or intensity matrix. With state 5, associated with y e [y,-, 
see Equation (4.3), the generic elements of Q can be written as 



1k,k+l 



p(k, k + 1; t, t + At) Pr{ crossing upwards in At\Sk] 
At At 
_ A^jAf 1 _ (41) 

JTk At Hi,- 

where N k is the expected number of times level k is crossed upwards in a second (the level 
crossing rate, LCR), and itk is the probability of state St- Going from Sk to S^+i is defined 
by crossing the threshold yt+i- Similarly, 

p(k, k — l;t,t + At) AT" 

qk,k-i — = (4.2) 

At itk 

For practical reasons described later, a certain hysteresis at switching points is introduced. 
In this case, the definition of LCRs 1 and is modified by considering the levels y k+ [ + 
<Pk + i and Yk — <Pt > respectively. The interpretation of the term it k with hysteresis is discussed 
later. Note that, by rewriting Equation (4.1) as p(k, k + l;f, t + At) — q ktk+ \At + o(At), 
replacing At = T u , where T u is the time unit of a discrete time model, we obtain the 
expression of the transition probability of the discrete time Markov chain representing 
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symbol or packet errors, used (without the concept of hysteresis) in Brummer [8] and 
related papers [12]. 

The quantities in Equations (4.1) and (4.2), namely the level crossing rate and state 
probabilities, are found in literature for some common processes like Rice and Nakagami. 
For more complicated channel models or for measurements traces when the terms in the 
right-hand sides of Equations (4.1) and (4.2) are not known, those quantities can be easily 
evaluated from the observation of the measurement traces or from link-level simulations 
of the possibly complicated models, by observing that the LCRs and state probabilities are 
Njf «s nf/t and n k & tk/t. In these expressions, n\ and are the number of crossing 
levels Yk upwards and downwards, respectively, t k is the time spent in region S k , and t is 
the total observation time. By replacing these expressions in Equations (4.1) and (4.2), we 
obtain q k ,k+i = n k+l /t k and q k ,k-\ = n^/t k . 



4.1 .3 Goodput and link adaptation 

The received signal quality can be measured, for example with the SINR, but in the following 
we will denote with y a generic metric for the quality level. The signal quality is a function 
of a number of contributions, related to both the transceiver structure and settings and the 
communication environment. Changes in the time-varying signal quality are due to various 
factors which change with different speed. Changes occurring in periods comparable with bit 
rate duration, 0(T\,i t ), or shorter, are dealt with typically with diversity gain. Slower changes, 
occurring in periods comparable with packet interval, 0(T p ^ t ), are handled by changing 
PHY schemes and possibly by proper scheduling. Changes of 0(7f rame ) and greater are 
handled typically with scheduling or renegotiations and reconfigurations at higher layers. 
In addition to diversity gain and multiplexing gain [24], link adaptation techniques are used 
to enhance performance. As a response to the received signal quality y, link adaptation 
algorithms initiate changes in the transceiver structure K r , keeping a performance metric 
e between certain boundaries: e(K r , y) e [acceptable values}. Link adaptation strategies 
include changing both modulation and channel coding schemes (referred to in the literature 
as adaptive modulation and coding), and transmission power, or the combinations of the two 
[25]. A larger number of transceiver configuration parameters can be adaptively adjusted 
according to some strategy. This leads to the concept of PHY mode, extension of the AMC 
and in line with the idea of software-defined radio (SDR). 

The definition of the reference and control signals that are fed into the adaptation 
algorithm is the basis of adaptive systems. These are the target quality and the actual 
signal quality, respectively. Basically, there are two possible methods for the definition of 
the control signal. The first is to let the transmitter know what the measured quality at 
the receiver is (Figure 4.2). Being a closed loop solution, it has the drawbacks that the 
signal that is used by the algorithm does not reflect the channel conditions at the transmis- 
sion time, and that the control message itself is subject to errors in the feedback channel. 
The other possibility is that the transmitter autonomously estimates the channel, based on 
the quality measured at its own side (Figure 4.3). This open loop scheme assumes reci- 
procity of the direct and feedback channels. UMTS TDD is an example of the system where 
such approach is possible. In both cases, a related problem is how transmitter and receiver 
agree their configuration. With closed loop, the channel status information is sent from the 
receiver to the transmitter, e.g. by piggybacking the information on outgoing packets. In this 
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Figure 4.2 Adaptive radio with receiver-controlled link adaptation, or closed loop control 
of mode switching. The receiver does the channel estimation and communicates 
the channel quality or directly commands the transmitter the mode to be used. 
Numbers indicate the sequence order of operations. Solid lines are data trans- 
mission, whereas dashed lines are control signaling. The metric used as a control 
signal, c(t) « y(t — t e ), is a delayed estimate of the true channel quality value, 
possibly affected by errors in the feedback channel. 
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Figure 4.3 Adaptive radio with transmitter-controlled link adaptation, or open loop control 
of mode switching. The transmitter first estimates the state of the channel based 
on the received signal, and then transmits data. Some coded information about 
the chosen mode must be sent, or complexity at the receiver must be added to 
estimate the mode. Numbers indicate the sequence order of operations. Solid 
lines are data transmission, whereas dashed lines are control signaling. The 
metric used as control signal, c(t) — y(t), is an approximated estimate of the 
true channel quality value. The value is also affected by the lack of channel 
reciprocity in two directions. 

receiver-controlled scheme the PHY mode will be obviously known at the receiver side. 
Conversely, in the open loop case, the mode can be either sent by the transmitter, causing 
overhead, or obtained blindly from the receiver, leading possibly to an increase in com- 
plexity and to PHY mode acquisition errors. It must be emphasized that, with both open 
and closed loop schemes, it is practically impossible for the transmitter to know what the 
actual channel state will be at the transmission instant at the receiver. Imperfections in the 
adaptation chain have an impact on the effectiveness of the algorithm. Indeed, the closed 
loop scheme is affected by delay, which is not less than the round trip propagation delay, 
and possibly by channel state estimation error. The open loop is affected by a much smaller 
estimation delay, but the estimate may not match the state at reception site and time. In 
addition to that, the information signaled from the receiver to the transmitter is prone to 
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errors in the communication channel. All these aspects are taken into account in the model 
for imperfections described in the next section. 

The error rate is a fundamental performance measure for QoS for both delay-insensitive 
and delay-sensitive services. According to QoS requirements, certain target error rates 
can be identified, and from them, the domain of the signal quality metric can be divided 
into M, nonoverlapping intervals. Each region is associated with a state of the CTMC 
5 = {So, S\, . . . , Sm-\}- According to the adaptation algorithm, one of the M PHY modes 
(a modulation and channel coding scheme pair) available in the system is associated with 
each region of the signal quality metric : M. — {Mo, M\ , . . . , M M - 1 } ■ Each state is therefore 
characterized by the bit rate and the error rate: 5, =>• M, =>■ r, , e,- . The set of the threshold 
levels, the mode switching points, {Yi}, i = {0, . . . , M], is defined so that: 

Y G [Km Yi+\) ^ Mi, 0 < i < M - 1 
Y G (Yo, Yi) =» So Mo (4.3) 

where Yo — — °o and Ym — +oo. For the PHY mode M, the selection must comply with 
the requirement e, < e max , 0 < i < M, where e max is the maximum tolerable error rate, i.e. 
the target error rate. By noting that e(y) is a monotonic decreasing function of y, we have 
Yi — e~'( e max)- If the statistics of the channel process are known, the nominal value of the 
signal quality metric in each region can be defined as follows: 

Yi= j YPy(Y)dY I j P Y (Y)dY (4.4) 

Yi Yi 

If not, it can be simply represented by the center value in the range. 

Switching points can be determined also using methods other than the one described 
above, including optimization [26]. The model described in this chapter is independent of 
the method used. 



4.1 .4 Switching hysteresis 

As with all real control systems, if the same threshold value was chosen for both process 
rising and falling thresholds, the system could exhibit too frequent mode switching. A 
solution to this known problem is to introduce a hysteresis, fixing two distinct values 
for falling and rising thresholds. If y~ and y + are falling and rising threshold values, 
respectively, the actual threshold values will be defined as: 

Y? = Y i ± ff (4.5) 

With cpf — (p^ = <pi = <p, 2(p is the width of the hysteresis region. Another choice is to 
select the switching thresholds V£ accordingly to the following equation: 



Yk Yk+<fi 




(4.6) 



n-<p t 



This definition can be applied independently of the method used to define the ideal thresh- 
old levels. For example, an effective definition of switching points could be obtained by 
imposing an upper limit on the probability of spurious switching in a given time. 
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4.1.4.1 Net bit rate 

The error rate used for expressing QoS requirements is usually the packet error rate (PER) 
since it is closer to the user's perception of quality. At the link layer, a packet is considered 
valid if no error is present or all errors have been corrected. In our model, the correction 
capability of the code is included in its coding gain, therefore, for the packet to be valid, no 
residual error is allowed. The packet error rate p E is therefore given by: 



where p e (y) is the bit error rate of the uncoded system, G c (y) is the coding gain, y is the 
SINR per bit, and L v is the packet length in bits. Any channel coding scheme can be included 
in this model provided that its performance is summarized in the coding gain curve, G c (y). 
In Equation (4.7), errors in the header and in the payload are assumed equiprobable. In 
reality, errors in the header are more critical because they may cause erroneous addressing 
and insertion errors. If distinct channel coding schemes are adopted for the two parts, with 
a stronger code for the header, the probability of a novalid packet is p' E = 1 — (1 — PE,h) 
(1 — pej) where the probabilities of erroneous header and payload are computed separately 
as p E ,h = 1 - {1 - Pe [yG c (y)]} Lh and p E j = 1 - {1 - p e [yG c (y)]} Li , respectively. 

Besides of its advantages in achieving reliable transfer of information, channel coding 
protection implies a loss in capacity due to the introduced redundancy. Moreover, each 
protocol layer adds its own header to the packet as it is proceeding downwards through 
the protocol stack. We assume in the following that the same channel coding scheme is 
applied to both header and payload. The generalization when distinct schemes are adopted 
is straightforward. The header efficiency is defined as the ratio between the payload size and 
the total packet size without redundancy (i.e. header and payload together, Ld = L, + Lh): 
r] h = Li /L^. The channel coding efficiency depends on the added redundancy and is repre- 
sented by its code rate rj c — L d /(L d + L 0 ). Since the packet size after channel coding and 
packetization can be written as L p = Li + L^ + L Q — Li/(r)hr) c ), the bit rate for a given 
PHY mode (still including residual errors) can be written as 



where fc, is the number of bits per symbol of the ith mode modulation scheme, and T s is the 
symbol duration. 

4.1 .5 Link service rate with exact mode selection 

The link service capacity is the rate at which error-free information units are transmitted 
through the channel and is referred to as 'goodput'. It is a function of the gross bit rate 
or throughput, which includes the corrupted information units, r, and the residual error 
rate after correction, e. By using a similar expression for the link service rate as given in 
Kim and Li [27] for ideal systems with fixed PHY, and extending it to time- varying systems, 
the goodput at a given instant t can be expressed as 



PE = i-{i-p e [yGc(y)]} 



(4.7) 



(4.8) 



R' c (t) = [1 - e(t)]r(t) 



(4.9) 



In Equation (4.9), r{t) is the gross channel transmission rate, or throughput, which de- 
pends on modulation constellation size, code rate, overheads, etc., and e(t) is the residual 
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error rate, which depends essentially on modulation scheme, channel coding gain, diversity 
combining methods, and physical channel conditions. The residual error rate after reassem- 
bling e has the same value of the packet error rate defined in Equation (4.7) i.e. e — p E - 

We extend further Equation (4.9) to adaptive systems. In adaptive systems the transceiver 
configuration settings change in time depending on the channel state at time t, denoted by 
S(t) = i or Si. The PHY mode ideally associated with each region and used at time t 
is denoted with M(t) — i or M,. To simplify notation, whenever possible we drop the 
dependence on time t . In presence of ideal link adaptation, the service capacity is obtained 
averaging across all the modes, resulting in 

M-1 

K = J2 [1 - ei]ri3ri (4 - 10) 

1=0 

where the subscript i denotes the mode, M is the number of implemented modes, and tt,- is 
the probability of the channel being in state i and using the ith mode. 



4. 1.5. 1 Model of impairments 

In Equation (4.10), the service capacities for all modes can be represented by a vector 
because the effective and chosen states are assumed to coincide. In case of erroneous mode 
selection, the gross bit rate depends on the used mode whereas the error rate depends on 
both the used mode and the effective state. Hence, in the presence of imperfections the two 
contributions must be separated. In the sequel, we will use the following notation. 

The throughput vector r = {r, } € R M includes the transmit rates for each PHY mode 
and is defined as r T = [>o • • • r M ^ \ ] with r,- given by Equation (4.8). The normalized 
goodput matrix Y = {y; 7 } e R MxM includes the effects of errors due to corruption in the 
channel for each PHY mode and each state, and expresses the useful bit rate normalized to 
the transmit rate in each mode. The generic element of matrix Y is defined as y t j = 1 — e- t j , 
where denotes the error rate when the mode M, is used and the channel is in effective 
state Sj. The elements of n, jt t = [7T 0 • • • n M -\ ], are the state probabilities of the 
CTMC. The mode selection is made by the adaptation algorithm based on an estimated 
channel conditions. If the adaptive system rapidly follows without errors the channel state, 
the average goodput, Equation (4.10), is expressed with the new notation by: 

M-1 

^lideal=E r '( 1 - e »)^ ( 4 ' n > 

!=0 

or, in compact form: 

^lideal = rTdia g( Yn ) ( 4 - 12 > 

where n = diag(7r) is the diagonal matrix having as diagonal elements the elements of 
vector 7i, and diag(A) = [a„ ] is the operator that extracts the vector of diagonal elements of 
matrix A: diag(A) = (A © I)u, where I is the identity matrix, u is the unity vector having 
all elements 1, and © is the Hadamard-Schur product. 
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4.1.6 Imperfections in the adaptation chain 

A real adaptive communication system is prone to erroneous setup of the transceiver con- 
figuration. Estimate errors, estimation delay and feedback errors have so far been neglected. 
The actual properties of the related errors depend on the particular system configuration. 
In this chapter, we are not interested in studying a specific system, but rather we want to 
model common sources of errors in a flexible way open to generalizations. 

In reality, the true channel condition is hidden. A delayed or erroneous version of it is 
visible instead and, based on that, the PHY mode is chosen. The parameter used for the 
mode switching can be expressed as y(t) — y (t — r e ) + e e + £f, where y and y are the true 
and the estimated value, respectively, e e is the estimate error, £f is the feedback error, and 
r e is the estimation delay or estimation lag. Among these three terms, the first one has more 
important effect in the open loop case, whereas, in the closed loop case, the other two terms 
are dominant. 

The model of imperfections is depicted in Figure 4.4. At estimation time t the true 
channel quality metric falls in the A:th region, say S(t) — k, and the metric is estimated. Based 
on this estimate S(t) — h, the mode is selected, where indices k and h may differ because 
of errors in the estimation process. Since there is equivalence between the cardinality of the 
PHY modes set M and the signal quality metric regions set S, the code can unambiguously 

True channel state Effective channel state 

Acquired channel state Estimated channel state ... ... 

at estimation time at transmission time 




Feedback process Estimation process Channel process 



Figure 4.4 The model of imperfections of the channel estimation process. Based on the 
true channel state at estimation time, an estimate subject to errors is obtained. 
Its identification code is sent as a message through the error-prone feedback 
channel and is acquired as the selected state. During the estimation delay, the 
channel state may change due to its dynamics. 
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represent either the quality metric region or the mode itself. After the estimation delay, 
the mode, subject to errors in the feedback channel, is transmitted and M(t + r e ) = i is 
acquired at the transmitter (or at the receiver, in the open loop case). This mode is used for 
transmission (Figure 4.2; or reception, Figure 4.3). In the mean time, the effective channel 
quality metric falls in general in a new region, S(t + r e ) = j, due to possible channel state 
transitions during the estimation delay r e . The model described here is unaffected whether 
estimator and mode selector are implemented at transmitter or receiver side, provided that 
proper values are given to related quantities. For example, in the open loop case, errors on 
the control message sent to the receiver through a separate channel are modeled with the 
feedback error. The final effect of the entire process is that in general mode M, is used at 
transmission time, when the channel is in effective state Sj . According to the notation above, 
the probability of this event can be written as P(M it Mh, S k , Sj). Because the estimation 
process and the channel process are clearly independent, and assuming that the feedback 
channel is independent of the direct channel, we can write 

P(M h , S k , M h Sj) = P(M h , S k )P(M h Sj \M h , S k ) 

= P(M h , S k )P(Mi \M h , S k )P(Sj \Mi, Mh, S k ) 

= P(M h , S k )P(Mi \M h )P(Sj \S k ) 

= P(S k )P(M h \S k )P(M i \M h )P(Sj \S k ). 

The three last terms in Equation (4.13) are treated separately in the sequel and their effect 
on the goodput is then combined by averaging independently the appropriate expressions. 



4.1 .7 Estimation process and estimate error 

In the absence of estimation delay and feedback errors, a possibly erroneous estimate 
of the metric at time t is used to choose the mode at time t, i.e. y(t) — > M(t). The mode 
estimation probability matrixH^ = {h^} € R MxM models the effects of the imperfections 
in the estimation process due to noise. Its generic element h hk represents the probability of 
selecting mode Mf, when the true channel is in state S k at estimation time: 

H (e) = {hfl B Pv{Mh | S k }} (4.14) 

Equation (4.14) represents the second term in Equation (4.13). 



4.1 .8 Channel process and estimation delay 

Now we study the probability that the true state has changed during the delay r e . Consider 
the presence of estimation delay and absence of estimate error and feedback errors. In other 
words, the mode at time t + r e is chosen depending on the exact estimate of the metric at 
timef,i.e. y(t) — > M(t + r e ). The delayed channel transition matrixH^ = {hf-} € R MxM 
models the effects of estimation delay. Its generic element hfj represents the probability 
that given the true channel was in state k at estimation time, at transmission time the channel 
is in state j : 

H (d) = [hfj - Pr{S(f + r e ) = j\S(t) = k}} (4.15) 
Equation (4.15) represents the last term in Equation (4.13). 
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4.1.9 Feedback process and mode command reception 

The code representing the PHY mode or, equivalently, the related signal quality region, may 
be corrupted during its transmission through the feedback channel. The mode acquisition 
probability matrix H (f) = {/z® } € R MxM models the probability of mode error due to errors 
in the feedback channel and has a generic element given by: 

H (f) = {/*® = Pt{Mi\M k }} (4.16) 

Equation (4.16) represents the third term in Equation (4.13). 

4.1 .1 0 Link service rate with imperfections 

Combining Equations (4.11), (4.12) and (4.13), and remembering the model in Figure 4.4, 
we get the expression of the average goodput, i.e. the link service rate in presence of 
imperfections: 

M-1 M-1 M-1 M-1 

Rc = E « Ed- e <v) E *S E *ft*Ag> (4- 17 ) 

j=0 7=0 h=0 k=0 

It is easy to see that the average goodput in Equation (4. 17) can be written in compact form 
as: 

R c = r T diag(Y0 T ) (4.18) 
where 0 = {& n j} £ ]R MxM is the mode-channel probability matrix: 

© = H (f) H (e) rm (d) = nH (d) H (f) H (e) = rm (4.19) 

having generic element 

M-1 M-1 M-1 M-1 M-1 

= E C E *S E »>-*3 = E *2 E *£MS? ( 4 - 2 °) 

fc=0 ifc=0 m=0 h=0 k=0 

Equation (4.20) models the probability of using mode M„ with the channel in the effective 
state Sj . Q is the equivocation matrix, which includes the effects of link layer adaptation 
process. The innermost summation in Equation (4.17), H (e) IIH (d) = Y!k=o h tk n khfj, is 
the probability of estimating the mode M k when the channel is in effective state Sj (at 
transmission time), averaged over all possible true channel states S^ (at estimation time). 
The term 0 = H©H( e >IIHW = J^hJo h \h T^=a *«Wjy « the probability of using 
mode Mi when the channel is in state Sj at transmission time, averaged over all possible 

estimated modes M h . The term diag(Y0 T ) = Y!f=a 0 ~ e u) Ylh=o h \h Hk=o ^kh^nf- 
is the normalized average goodput vector representing the goodput when using mode M,- , 
averaged over all possible channel states at transmission time Sj. Finally, the average 
goodput is given by averaging the quantity above over all possible used modes, weighted by 
the transmit rate, thus removing the previous normalization by the product r T diag(Y0 T ), 
leading to Equation (4.17). The scalar normalized average goodput is obtained by summing 
the elements of the normalized average goodput vector: 

R r = |diag(Y0 T )| (4.21) 
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R r corresponds to the average goodput of a system adopting an hypothetical set of PHY 
modes all having unitary transmission rate. The use of this quantity will be elaborated later. 
In the ideal case of perfect link adaptation, the channel quality metric is estimated without 
errors, tracked without delay, and communicated to the transmitter without errors, i.e. 
H (e) = H (d) = H® = I, H reduces to the identity matrix, and we have 0 = II. As a result, 
under the assumptions defined above, the definition of average goodput given in Equations 
(4.17) and (4.18) reduces to the one of the ideal case, Equations (4.11) and (4.12). Single 
PHY mode systems are considered in this model as a special case with errorless feedback 
channel, H (f) = I, instantaneous estimation, H (d) = I, and static mode selection leading to 
a matrix H (e) having as nonzero elements all 1 in the ith row, if mode M, is implemented 
in the system: = S hi S kk , where &,j is the Kronecker delta. 

4. 1.10.1 System performance 

For illustration purposes, we apply the above analysis tool to the study of a sample system 
and investigate its sensitivity to some imperfection or implementation constraints. 

4. 1. 10.2 System parameters 

The PHY modes include a no-transmission mode for values of the signal quality that do not 
comply with minimum service requirements. The modulation schemes used in the numerical 
analysis are BPSK, QPSK, 8QAM and 16QAM. The channel coding scheme used is the 
half rate convolutional code having generator polynomials go — 133s and g\ — 171s, and 
constraint length K — 1 . Coding gain curves for the half rate and punctured versions of that 
mother code, having rj c — 2/3 and ?? c = 3/4, are obtained from standard sources [28, 29]. 
The minimum signal to interference-plus-noise ratio for the coded system, y m i n , is obtained 
from BER max = BER (y^m) , knowing the coding gain for the given channel coding scheme: 
y min — y^in/^c- Inverses of the BER of all schemes can be efficiently obtained from the 
approximated parametric function BER (/'), valid for all schemes [25], or by numerical 
inversion of exact expressions [30]. In the system used for numerical analysis, switching 
thresholds are obtained from the required PER for a packet length of 2048 b. 

We assume Ricean fading process [31]. The expression for the LCR for the case of a line- 
of-sight component with zero Doppler frequency, uncorrelated Gaussian noise components, 
and Jakes-shaped Doppler power spectral density or isotropic scattering can be found in 
[32, 33]. Under the assumptions above, and using the corresponding LCR, expressions (4. 1) 
and (4.2) can be written as 

qk,k+l = /maxCToV^T^tet+i; K f, (To)/AP|(^, fjfc+i; Kf, <7 0 ) 

and 

qk,k-\ = f m ^ayf^Pn(Hk\K f , o 0 )/AP^(^ k , % k+1 ;K { , a 0 ) 

where / max is the maximum Doppler frequency, er 0 is the standard deviation of the quadrature 
Gaussian components, p^ k ;Kf, a®) is the value of the PDF of the Ricean distribution 
having Rice factor Kf at signal level % k , and APf(^, % k+ i\Kf, oq) is the difference of the 
Rice cumulative distribution function (CDF) at points % k+ \ and 
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Table 4. 1 Channel model parameters 
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State probabilities, level crossing rates downwards and upwards, and birth and death rates 
for the CTMC with five states used in the examples. 



4. 1. 10.3 Channel model statistics 

Parameters of the channel model for the example system are given in Table 4.1. Match of 
the statistics of the channel model is verified by comparing [34] the histogram obtained by 
simulation of the CTMC with the theoretical solution of the CTMC and with the theoretical 
PDF of the underlying fading process. 

Figure 4.5 shows a snapshot of the time series obtained by simulations with above CTMC 
model with / max = 300 Hz. It can be seen that very large changes in the metric (about one 
order of magnitude) occur in less than 1 ms around 3.5 ms, very rapid changes occur around 
7 ms, and finally relatively long periods with steady values are seen on the right-hand side 
of the diagram. The system with f max = 1 Hz simulated over 50 000 values, corresponding 
to a simulated time of r sim 4.6 h, exhibits r it min 6.2 (is and T it max 5.5 s as minimum 
and maximum of the inter-transition time, which depends on both the Doppler frequency 
of the fading process and the size of the signal quality regions. 

4.1 .1 1 Sensitivity of state probabilities to hysteresis region width 

Consider the probability of staying in region St, Jik- The kth state is entered by crossing 
levels Yk + <Pk or Yk-\ — 9k-v wnen entering from below and above, respectively. This 
implies that in this case the actual width of the region is narrowed. Similarly, the state is left 
by crossing levels Yk — <Pk or Yk-\ + vt-v wnen going below and above, respectively, and 
the actual width is now broadened. Although the ingress rate roughly equals the egress rate, 
as it can be seen in Table 4.1, the impact of hysteresis also depends on the state probabilities. 
In fact, the state probabilities after introducing the hysteresis can be written as: 

7tj h) = m + A^+V,- + A^m - A^'- ) 7r i+1 - A^ + V,_! + o(Acdf) (4.22) 

(h) 

where n] and 7T, are the state probabilities with and without hysteresis, respectively, and 
A^J^p' is the half- width of the hysteresis region i at upper (+) and lower (— ) side, respectively. 
In case of equal and symmetric hysteresis regions and neglecting higher order infinitesimals, 
we have 

jrf 0 » rti + A CDF (2m - it i+l - m-i) (4.23) 

Figure 4.6 shows the sensitivity of state probabilities to Acdf in this latter case, with regions 
defined as in Equation (4.6). For larger values of Acdf, states with larger probability, the 
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Figure 4.5 Converted snapshot of a signal level, as a function of time, obtained from the 
time series generated by simulations with CTMC model for f m . dx — 300 Hz. It 
can be seen that very large changes in the metric (about one order of magnitude) 
occur in less than 1 ms around 3.5 ms, very rapid changes occur around 7 ms, 
and finally relatively long periods with steady values are seen on the right-hand 
side of the diagram. 

central ones in our case, have their probability further increased by the introduction of 
hysteresis. It can be observed that up to about A CDF = 10~ 3 , state probabilities are almost 
unaffected. 



4.1 .12 Estimation process and estimate error 

Under the assumption of constant noise power, the problem of studying the SINR is reduced 
to the study of the signal envelope. The envelope is estimated with an estimation error. The 
distribution of the estimation error depends on the specific adopted estimation technique 
and analytical and/or empirical distributions for estimation error are generally unknown 
[20]. For these reasons, and because we want to illustrate the model rather than to provide 
performance analysis for a specific scenario, no assumption is done here on the structure of 
the estimator. For the example below, we assume the error to be Gaussian distributed. Under 
these assumptions, it is easy to see that the generic element of matrix H (e) , the probability 
of selecting mode M/, when the true state is S*, can be written as: 



l(0 
l hk 



0.5 



erfc 



^ inffe)-!^ _ 



erfc 



(4.24) 
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Figure 4.6 Sensitivity of state probabilities to Acdf- Up to about Acdf = 1CT 3 , state prob- 
abilities are unaffected. For larger values of Acdf* states with large probability, 
the central ones in our case, have their probability further increased by the 
introduction of hysteresis. Bars are simulated values, whereas points are the 
theoretical values given by Equation (4.23). 



In Equation (4.24) erfc(g) is the error function complement, er e is the standard deviation of 
the estimation error, | t is the nominal value of the metric in the £fh interval S^, and inf(£/,) 
and sup(£/,) are the lower and upper boundaries, respectively, of interval Sh- Figure 4.7 
shows the comparison of simulation and theoretical results for the sensitivity of the average 
goodput ^ c on estimate errors, under the assumptions given above in the non-coherent case, 
assuming absence of estimation delay and error-free feedback channel. 



4. 1. 12. 1 Channel process and estimation delay 

The variability in time of the quality metric depends on both the fading characteristics 
(coherence time) and the width of the state regions and is expressed by the inter-transition 
time, i.e. the time between state changes. In order to simplify the analysis, we assume 
that the estimation delay is negligible compared with the minimum inter-transition time. 
For our analysis it is sufficient to assume that Vi e [0, M — 1] A, T e = #,,,+it e = 1 and 
^, T e = <?,,,- iT c = 1, where M is the cardinality of sets of PHY modes and quality regions. 
For the Poisson process, the reciprocal of the transition rate is the mean inter-transition time. 
The previous constraint states basically that the estimation delay should be much smaller 
than the mean inter-transition time. 
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Figure 4.7 Sensitivity of average goodput to variance of channel estimation error. Average 
goodput ^ c (solid line), scalar normalized average goodput ^ r (dashed line), 
and simulations (crosses). 



Under the previous assumptions, at most one state change can occur during r e , and the 
probability of channel state transition can be written as: 



XjT e 

0 



j = i + 1 
j = i-l 

j = i 



It is straightforward to see that 



H (d) = I + Qr e 



(4.25) 



(4.26) 



where Q is the rate matrix of the CTMC and I is the identity matrix. In particular, in case of 
instantaneous channel estimation, matrix H (d) reduces to the identity matrix. The probability 
of no transition from the z'th state is 1 — (X t + Hi)r e . This term is null for r e = 1/(A,- + fif). 
Therefore, the quantity 

T c = min + /a,-)} 

i={0,...,M-1) 

has the meaning of coherence time in our model. Considering the case of perfect estimation 
and errorless feedback channel, and under the previous assumptions for the estimation delay 
model, the average goodput is expressed by: 



£ c (r e ) = r T diag(Y0 T ) = r T diag[Y (i + Q T r e ) II] 

= r T diag(YIl) + r e r T diag(YQ T II) = /? c (0) + t e V d 



(4.27) 
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where 

*d = ^ = r T diag(YQ T n) (4.28) 

is the drift of the average goodput from the ideal conditions due to the estimation delay. 
Replacing Q in Equation (4.28) it is easy to see that the explicit expression of * d is: 

dR M 

*d = -r- 1 = ri7ti " «'■.'■-») + k * e U ~ ( 4 - 29 ) 



4.1 .1 3 Feedback process and acquisition errors 

To identify a PHY mode, a control message with |~log 2 M~\ bits is used. The distance in bits 
among all pairs of codewords is given by the elements of a symmetric matrix having null 
elements on the diagonal, and defined by 

[log 2 M] 

D = {dij}B (4.30) 

n=\ 

where w " is the nth bit of the ith codeword and <g> denotes the modulo 2 bit-wise product 
of the codewords. The element of matrix H (f) is then given by: 

hf h =pt^-P^f°^- dih (4-31) 

where p tJ j is the residual bit error probability in the feedback channel. Parameter p ej j 
is a function of channel coding gain and SINR through the bit error curve of the uncoded 
system for the given modulation, and is given by /? e , r ,f = P^[G c (y)y]- 

One way of coding mode identifiers is adopting the Gray code. Assume that the message is 
transmitted always using the strongest mode, BPS K- 1/2, in our case, but the error probability 
depends on the actual channel state. The error rate does depend on the state of the feedback 
channel. Under the assumption of symmetric channel, the channel state at control message 
transmission time is assumed to be the state of the direct channel r e before transmission time, 
i.e. the state at estimation time. Under this assumption, the delay in the adaptation chain 
is concentrated in the estimation delay model. With these assumptions, it is seen that the 
effect of feedback errors is negligible. Therefore, less relaxed assumptions on this feedback 
error model might not lead to very different results. In fact, as it is shown in Figure 4.8, in 
which the BER in the feedback channel is assumed fixed and independent of the state 
of the direct channel, the impact is no longer negligible only for values of p® so large as 
to be out of range in adaptive systems and often in communications systems in general. 



4.2 ADAPTIVE TRANSMISSION IN AD HOC NETWORKS 

The adaptive-transmission protocol described in this chapter is intended for mobile packet 
radio networks (PRNs), in which half-duplex radios employ direct-sequence spread spec- 
trum (DS-SS). PRNs are examples of the ad hoc networks described in detail in Chapter 13. 
One of the key features of such networks is the lack of a fixed infrastructure or central 
controller to determine the power levels, code rates and other transmission parameters that 
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Figure 4.8 Sensitivity of average goodput to errors in the feedback channel. In this figure, 
the residual BER in the feedback channel is assumed fixed and independent 
of the state of the direct channel. The impact is no longer negligible only for 
values of p e r f so large to be out of range in adaptive systems and often in 
communications systems in general. 



should be used by the terminals in the network. The adaptive-transmission protocol is a 
fully distributed protocol that determines the transmission parameters for a communication 
link from statistics obtained in the receiver on the link. 

The receiver includes an automatic gain control (AGC) subsystem, one or more pro- 
grammable matched filters (with different processing gain), a soft-decision decoder, and a 
subsystem that generates a postdetection signal quality (PDSQ) statistic. Three statistics, 
which can be derived in each receiver, are used to decide on the receiver configuration. 

The PDSQ statistic is based on the output of the matched filter. It represents the desired 
signal level. In general this statistics can have high s(h) or low s(l) value. The symbol- 
error rate (SER) statistic is determined from an error count that is derived in the decoding 
subsystem. The error count can be obtained by re-encoding the information bits at the output 
of the decoder and comparing the encoded symbols with hard decisions that are based on 
the corresponding outputs of the matched filter. SER can have high e(h) or low e(l) value. 
The AGC statistic is an estimate of the total received power that indicates the level of 
interference and can have high i(h) or low i(Z) value. Using a combination of the PDSQ, 
SER, and AGC statistics, the receiver selects the transmission parameters for the next 
packet, and it sends this information to the transmitter in an acknowledgment packet or as 
part of the exchange that occurs if a reservation protocol is used for channel access (e.g. 
in a clear-to-send message). The PDSQ, SER, and AGC statistics derived in the receiver 



120 



ADAPTIVE AND RECONFIGURABLE LINK LAYER 



provide the only CSI that is available to the adaptive-transmission system. The transmission 
parameters that are adapted are the power level, spreading factor and code rate. As part of 
the design process for the adaptive transmission protocol, a set of thresholds, referred to 
as the adaptation thresholds, must be chosen. More details on statistics measurements and 
the threshold selection can be found in Block and Pursley [35]. The adaptive-transmission 
protocol uses the thresholds in conjunction with the AGC, PDSQ and SER statistics from 
past receptions to determine the transmission parameters for the next packet transmission. 

A combination of measured values s( ) e( ) and ;'( ) is used in the adaptation algorithm 
to increase /( ), decrease D( ) or not to change N( ), the code rate R, power level S or 
processing gain G. NI( ), for example, represents the action 'not change' or 'increase' the 
parameter where 'not change' is the first choice that will be used in the first following 
packet. If the performance with the first choice is not satisfactory, the second choice will 
be used in the next transmission. Only one parameter is changed in the next transmission 
interval. 

In general, the goal is to keep S low (save energy and minimize multiple access inter- 
ference) and R high (minimize redundancy and maximize the throughput). High G would 
improve interference suppression but at the same time reduce the throughput for the fixed 
chip rate. Some examples of the control action given the measured statistics are: 

(1) s(l), e(l), i(l)=^I(R), D(S), D(G) 

(2) 5(7), e(h), i(l)=*ND(R), NI(S), N(G) 

(3) s(l), e(h), i(h) ND(R), NI(S), 1(G) 

Extension of the control rules to other combination of the measured statistics is straight- 
forward. 

The results shown in this section are for a communication system with binary differential 
phase-shift key (PSK) data modulation and binary DS-SS. Error correction is achieved with 
binary convolutional coding and soft-decision Viterbi decoding. Convolutional codes of 
rates 1/2 and 3/4 are available to the adaptive-transmission protocol. The code of rate 1/2 
has constraint length 7 and minimum free distance 10. 

For performance results, the spreading sequence is a maximal-length linear-feedback 
shift-register sequence of period 1023. An interfering signal, if present, uses a different 
m -sequence of length 1023. The spreading factors available to the adaptive-transmission 
protocol are 32, 64 and 128, and the number of information bits per packet is fixed at 
1000. For the results, each simulation concludes after 50 000 packets are demodulated and 
decoded correctly. The sampling interval for the PDSQ statistic is to — T c (chip interval). 

The channel model incorporates range attenuation, shadow loss, MAI, and multipath 
propagation. Each of these four characteristics may be fixed or time-varying. For our pur- 
poses, a Markov channel is a communication channel for which at least one characteristic 
is time-varying, and each time-varying characteristic is modeled as a discrete-time Markov 
process. The Markov model for range attenuation is illustrated in Figure 4.9 [35], and the 
Markov model for the loss due to shadowing is illustrated in Figure 4.10. The model for 
the interference caused by other transmissions in the network consists of the two Markov 
chains, illustrated in Figure 4.11. Finally, each multipath component is modeled by the 
Markov chain in Figure 4. 12. Each Markov chain may change states from one packet trans- 
mission to the next, but the state is assumed to be constant over the duration of the packet 
transmission. 
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Figure 4.9 Markov chain for range attenuation. All transition probabilities are p — 0.05. 

Because a Markov channel is represented by one or more finite-state Markov chains, each 
of which has multiple states, it is actually a collection of a number of different channels. 
Such a collection is referred to as a channel class. The channel class denoted by C\ consists 
of the collection of all channels for which there is no shadow loss, multipath or MAI. For 
class C\ , the only channel characteristic that has multiple states is the range loss, which 
takes values from D m i n to D max in steps of 1.5 dB, as illustrated in Figure 4.9. The values for 
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Figure 4.10 Markov chain for shadowing. 



(a) 




Figure 4.11 Markov chains for the interfering signal. The Markov chain in (a) determines 
the received power of the interferer, and the chain in (b) determines the spread- 
ing factor of the interferer. 




Figure 4.12 Markov chain for a single multipath component. 
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Table 4.2 Channel class parameters 
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Z) m ; n and D max for class C\ are 89 and 137 dB, respectively. Each of the other five channel 
classes also has multiple states for the range loss, but the values for D min and D max are 
different (see Table 4.2). The transmitter has six power levels arranged in steps of 6 dB. 

In the model, the minimum power level provides an SNR of E c / No — 0 dB at the receiver 
if the propagation loss is 90 dB. The parameter E c is the received energy per chip for the 
desired signal. The value 0 dB is referred to as the reference value for E c /No, and 90 dB is 
the reference value for the propagation loss. If the chip rate is 16 Mchips/s, E c /No — 0 dB 
gives S/Nq ~ 72 dB, where S is the received power for the desired signal. If the transmitter 
uses its maximum power level and the propagation loss is 137 dB, the maximum value 
for class, then the SNR at the receiver is E c /No = — 17 dB. The maximum power level 
is 30 dB above the minimum power level, and the maximum propagation loss is 47 dB 
greater than its reference level, so E c /N 0 is 17 dB less than its reference value. Class C2 
has variable shadow loss in addition to variable range attenuation. Each state is identified 
by its corresponding shadow loss, 5,-, 0 < 2 < 2, as illustrated in Figure 4.10. The value of 
So is 0 dB, and the values of S\ and S2 for class C2 are 10 and 20 dB, respectively, as shown 
in Table 4.2. Shadow losses of 20 dB or more have been observed in urban environments 
(see Chapter 3). The transition probability from 5 ( - state to Sj is denoted by The values 
of the transition probabilities are also given in Table 4.2. 

Notice that D max , the maximum range loss, is reduced from 137 to 1 17 dB in going from 
class C\ to class C2. The reason for this is to have the same maximum value for the total 
propagation loss for the two classes. In this way, the same set of transmitter power levels is 
appropriate for both classes. If we were to keep the maximum range attenuation at 137 dB 
and allow a maximum shadow loss of 20 dB, then the maximum transmitter power would 
have to be increased by 20 dB to provide the same SNR for the worst channel in the class. 
Because the minimum power level should remain the same in order to provide the same 
SNR for the best channel in the class, either the step size or the number of steps would 
have to be increased. It is more meaningful to keep the transmitter capability the same for 
different channel classes. 

Channel class C3 has intermittent MAI. The strength and spreading factor for the inter- 
ference are governed by the Markov chains illustrated in Figure 4.11, where E c j denotes 
the energy per chip for the interference signal. The model for the interfering signal includes 
the use of a rate- 5 code, so the spreading factor determines the duration of the interference. 
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The maximum range attenuation is adjusted so that the maximum transmitter power can 
provide an adequate packet-error probability for the worst channel in the class, which cor- 
responds to the maximum MAI and a range attenuation of 129 dB, the value of D max shown 
in Table 4.2. 

Multipath is present in channel class C\ in addition to the primary signal. The strengths 
of the different multipath components relative to the primary component are controlled by 
independent Markov chains, each having the form shown in Figure 4.12. 

The delays of the multipath components relative to the primary component are chosen 
randomly and without replacement from the set {nT c ; 1 <n< 150} (i.e. no two components 
have the same delay). The adaptive protocol is compared with a protocol that has fixed 
transmission parameters and a protocol with perfect CSI (channel state information). As the 
name implies, the latter protocol is always told the state of the channel for previous packet 
receptions, and so it is a model for an adaptive-transmission protocol that is perfect in its 
channel measurements and estimates of the channel state. Note that perfect knowledge of the 
past states does not imply perfect knowledge of the channel state for the next transmission, 
thus, the results for the protocol with perfect CSI do not provide an upper bound on the 
performance of adaptive-transmission protocol. The protocol with perfect CSI chooses the 
transmission parameters according to the channel state that was in effect during the most 
recent reception. 

Each of the three protocols attempts to maximize the throughput efficiency, subject to 
constraints on the packet-error probability. The throughput efficiency, a measure of the 
throughput per unit energy, is defined as the ratio of the number of packets successfully 
received to the total energy expended. The constraint on average packet-error probability 
is Pe < 0.05. In some circumstances, the protocols may be able to improve the throughput 
efficiency by tolerating large numbers of errors in very poor channel conditions, while still 
meeting the constraint on P E . However, this is unfair, in the sense that the protocol would not 
provide an average packet-error probability of 0.05 over a time interval in which the channel 
state was very poor. To avoid this, it is required that each protocol adapt the parameters in 
order to provide a packet-error probability of 0.05 or less for each channel in the class. Note 
that meeting the latter constraint does not ensure that the former constraint is also met. For 
example, after a change in channel state, the error rate may be high until the protocol has 
adapted all parameters to the new state. The performance comparisons are for two scenarios. 
In the first scenario, each of the three transmission protocols knows the channel class but 
not the current channel state. The protocol with perfect CSI is told the state of the channel 
for the previous transmission, but the other two protocols are not given any information 
about the previous channel state. Each protocol chooses its transmission parameters in the 
best way for the given channel class. For example, the protocol with fixed transmission 
parameters uses the minimum power level, minimum spreading factor, and maximum code 
rate that provide the required packet-error probability for the worst channel in the given 
class. Similarly, the adaptive-transmission protocol uses the adaptation thresholds that are 
best suited for the given class. 

In Figure 4.13, the throughput efficiency of each protocol is shown for each class. 
As expected, the protocol with fixed transmission parameters performs substantially worse 
than the protocols that adapt the transmission parameters. The protocol with perfect CSI has 
slightly better performance than adaptive-transmission protocol for classes C3, C4, and C5. 
For the remaining classes, the two protocols give nearly the same performance. Even though 
classes C\ and C2 have the same worst-case propagation loss, the results in Figure 4.13 
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Figure 4.13 Throughput efficiency for protocols with a known channel class. (Reproduced 
by permission of IEEE [35].) 




Figure 4.14 Throughput efficiency for channel class C as parameters P\q — P20 of the 
Markov chain of Figure 4.10 are varied. Adaptation thresholds are constant. 
(Reproduced by permission of IEEE [35].) 

show that the throughput efficiency is much higher for class C%, in spite of the possibility of 
shadowing that is included in this class. This difference in throughput efficiency is due only 
to the choice of channel class parameters. For the parameters given in Table 4.2, class C\ 
is much more likely than class C2 to exhibit maximum or near-maximum propagation loss. 
In Figure 4.14, the throughput efficiencies are shown for a range of values of Pio and P20 
from the Markov chain of Figure 4. 1 0. As these probabilities are increased from 0.0 1 to 0. 1 , 
the probability of having no loss due to shadowing increases, so the throughput efficiency 
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Figure 4.15 Throughput efficiency for protocols with an unknown channel class. (Repro- 
duced by permission of IEEE [35].) 



also increases for each of the adaptive-transmission protocols. Increases in and P20 are 
offset by corresponding decreases in P\\ and P22, which are the probabilities of staying in 
states 1 and 2, respectively. 

In the second scenario, the transmission protocols do not even know the channel class. 
The design of the protocols must account for the full range of channel conditions that 
can be experienced for all six classes. The performance of a protocol in this scenario is a 
good measure of its robustness. In Figure 4.15, the throughput efficiency for each protocol 
is shown for the six channel classes. If an adaptive protocol is used, the average energy 
needed to meet the requirement on the error probability is substantially less than for fixed 
transmissions. The protocol with perfect CSI gives the best performance, but adaptive- 
transmission protocol does nearly as well. 



4.3 ADAPTIVE HYBRID ARQ SCHEMES FOR WIRELESS LINKS 

This section considers the problem of using error-control coding in the data link layer to 
achieve reliable communication over a wireless link. Broadly speaking, there are two types 
of error control: forward error correction (FEC), considered so far in this chapter, and ARQ. 
ARQ is efficient when the channel condition is good or moderately good, but as the channel 
condition deteriorates, ARQ throughput performance becomes unacceptably poor. In this 
chapter we consider: (1) an adaptive hybrid of FEC and ARQ using the Reed-Solomon 
(RS) code; and (2) adaptive frame-length control. RS codes are known to provide excellent 
error-correction capability, especially in terms of bursty error suppression. The RS code 
rate and frame length are chosen adaptively based on the estimated channel condition to 
maximize the throughput performance. We also consider error control on the MAC header 
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since: (1) without proper error protection of the header, it would be of no help to apply any 
error control on the user data; and (2) the MAC header is also part of the data link layer. More 
discussion on adaptive frame length and coding can be found in References [36-39]. Three 
error-control schemes (referred to as FECI, FEC2 and FEC3) are considered depending 
on: (1) how many RS code segments are used for each packet; and (2) how a packet 
with uncorrectable errors is retransmitted. The computation complexity of an error-control 
scheme is closely related to battery power consumption. Encoding/decoding processes with 
RS codes are known to consume substantial battery power. In this chapter we use the central 
processing unit (CPU) time for encoding/decoding of an RS codec implementation as the 
measure of computational complexity. While the actual complexity of the RS code will 
depend on a particular implementation, it is believed that the CPU time measurement can 
be a good reference. The three schemes are compared in terms of throughput performance 
and computation complexity. 

4.3.1 RS codes 

For the error control of user data, (N, K, q) RS codes over GF(q) are used, in which the 
codeword size N < q — 1 and the number of information symbols K < N. A q-ary symbol 
is mapped to b bits, so q — 2 b . RS codes are known to have the maximum error-correction 
capability for given redundancy, i.e. a maximum distance separable (MDS) code. For an 
(N, K, q) MDS code, the minimum distance rf min is determined as <i min = N — K + 1, 
where the error correction capability t — (cf min — l)/2 = (N — K)/2, i.e. any combination 
of t symbol errors within N symbols can be corrected. The code rate r c is defined as 
r c — K/N. One can easily see that the more parity symbols (i.e. larger N — K), the better 
error-correction capability. RS codes are also known to be efficient for handling bursty 
errors. For example, with (N, K, 2 h ) RS code with the error correction capability t, as 
many as bt bit errors can be corrected in the best case when all of b bits in each of bt-hii 
symbols are erroneous (i.e. bursty errors). However, only t bit errors can be corrected in the 
worst case when only one bit in each of tb bit symbols is erroneous (i.e. nonbursty errors). 
Originally, the codeword size of (N, K, q) RS code is determined to be q — 1. However, 
a shorter codeword can be obtained via code shortening. For example, given an (N, K, q) 
code, K — s information symbols are appended by s zero symbols. These K symbols 
are then encoded to make an N symbol-long codeword. By deleting all s zero symbols 
from the codeword, we can obtain (N — s, K — s) code. For decoding this shortened code, 
the original (N, K) decoder can still be used by appending zero symbols between K — s 
information symbols and N — K parity symbols. Shortened RS codes are also MDS codes. 
Code shortening is especially useful for transmitting information with less than K symbols. 

4.3.2 PHY and MAC frame structures 

In this segment parameters of the popular WaveLAN modem are used. PHY and MAC 
overheads of the WaveLAN are shown in Figure 4. 16 [40]. In WaveLAN, no error-correction 
coding is implemented; only the CRC code for error detection is used. In this section the 
MAC frame structure is modified as shown in Figure 4.17 for error-control schemes. First, 
the MAC header size is increased by one byte. This additional byte is used to give the 
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Figure 4.17 The newly modified MAC frame structure. 



error-control scheme information like the RS-code rate used for the subsequent user data, 
which can be adaptively changed. For the MAC frame header, the binary Bose-Chaudhuri- 
Hocquenghen (BCH) code is adopted, so parity bits (equivalently, 14.5 bytes) are appended 
to the header. Using this code, up to t — 15 bit errors can be corrected. Note that this BCH 
coding is fixed for every MAC frame so that the receiver can receive/decode it without any 
extra information to check if the received MAC frame is destined for itself. 

The user data plus CRC is divided into M segments, and each segment is encoded by an 
RS code. The segmentation and encoding procedures are detailed in the next section. The 
user data is assumed to be an IP packet with a 20-byte-long header and, throughout this 
section, the terms 'user data' and 'packet' are used interchangeably. In order to examine the 
performance of error-control schemes, we specify various overheads of interest as follows: 

• H - the length of MAC header overhead including BCH redundancy = 31.5(= 17 + 
14.5) bytes = 252 b; 

• O - the length of PHY overhead = 420(= 120 + 292 + 8) b (assuming n = 0); 



• C - the length of CRC and IP header overheads = 24(= 4 + 20) bytes = 192 b. 
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Table 4.3 Nine RS codes considered for adaptation 
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4.3.3 Error-control schemes 

Nine RS codes as shown in Table 4.3 are considered. The reason for limiting the number 
of codes is that a different encoder-decoder pair is needed for each (N, K, q) RS code. 
Described below are error-control schemes for the link layer. 

4.3.3. 1 A hybrid of FEC and ARQ 

A hybrid of FEC and ARQ is used. The receiver attempts to correct errors first and, if 
the errors cannot be corrected, retransmission of the packet is requested. When errors are 
successfully corrected, an acknowledgment is transmitted to the sender and, when errors 
are detected but not correctable, a negative acknowledgment (NAK) is sent. In this section 
the selective-repeat (SR) ARQ is assumed. The sender keeps transmitting packets without 
waiting for ACK/NAK of those packets already transmitted. If NAK of a transmitted packet 
is received, or if neither ACK nor NAK of a packet is received within a timeout interval, the 
packet is retransmitted. The timeout interval is determined based on the roundtrip time. As 
will be clear below, the schemes considered here do not depend on ARQ, and hence they 
can be used in conjunction with any other ARQ schemes, such as stop-and-wait (SW) and 
go-back-/V (GBN). 

Throughput performance and complexity may depend on the underlying ARQ scheme. 
The SR-ARQ scheme is known to achieve better throughput performance than SW and 
GBN-ARQ at the expense of higher complexity. 

An ACK/NAK packet consists of four bytes, in which the first two bytes are for the frame 
number of the packet associated with ACK/NAK, the third byte is to inform (1) whether 
it is an ACK or NAK, and (2) the adapted code rate. The last byte is a checksum. As 
discussed in Section 4.1, the code rate is mainly adapted by the receiver depending on the 
estimated channel state/condition; the code-rate information is fed back to the sender within 
each ACK/NAK packet. These four bytes are encoded by (148, 32) BCH code, which is a 
shortened code from (255, 139) BCH code adopted for the MAC header error protection. 

4.3.3.2 Sender side 

An IP packet is fragmented into a number of segments, referred to as user data, depending on 
the maximum user data size that can be accommodated in a MAC frame, or the maximum 
transmission unit (MTU). The MTU of the WaveLAN is originally 1.5 kbytes. In this 
segment, MTU is adapted dynamically as required by the underlying error-control scheme. 
The CRC is calculated for both the user data and MAC header. The combined user data and 
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CRC are then divided into M segments, where each of the first M — 1 segments is Kb bits 
long and the length of the last segment is <Kb bits. Note that MTU can be determined as 
MKb —32 bits, where —32 represents the CRC overhead. Each segment is encoded with 
(N, K, 2 b ) RS code. For the last segment, code shortening might be needed. The MAC 
header is then encoded by (255, 139) BCH code. Note that one byte of the header is used to 
represent the information of the error-control code such as N, K, and M. For the schemes 
discussed in this segment, only a set of codes is used, so one byte suffices to specify all the 
necessary information. The MAC frame forms the structure, as shown in Figure 4.17. 

4.3.3.3 Receiver side 

The error control at the receiver works as follows: 

(1) The header of the received frame is decoded at the MAC sublayer. If the header is 
successfully decoded and, if the frame is found to be destined for itself, the user data 
with M RS -coded segments as well as the RS code information from the header are 
sent upward to the link layer. 

(2) At the link layer, each of M RS-coded segments is first decoded by the RS decoder, 
then the entire frame (including the header and user data) is checked by the CRC 
decoder. 

As described in the next section, the RS decoder at the receiver first attempts to correct 
errors. If the errors are uncorrectable, only their presence will be detected. Note that errors 
can be detected in three different places in the receiver: the MAC sublayer by the BCH 
decoder, the link layer by the RS decoder, and finally by the CRC decoder. This three-level 
error detection detects virtually all uncorrected errors. 

4.3.3.4 Three error-control schemes 

The following three possible error-control schemes using the above error-control facilities 
are considered. 

• FECI - M — 1 , when a long RS code with a large N is used for each packet. 

• FEC2 -M/l and the entire packet is retransmitted if any RS code segment has uncor- 
rectable errors. 

• FEC3 - M/ l and each code segment with uncorrectable errors is requested to be 
retransmitted. 

Then, a set of those code segments received in error is retransmitted by the sender in one 
MAC frame. Note that, for FEC3, a simple form of ACK and NAK is not enough; more on 
this issue will be discussed later. A packet decoded without any RS decoding failure can 
have undetected errors as explained in the next section. These errors are most likely to be 
detected by the CRC. In this case, the sender is requested to retransmit the entire packet for 
all three schemes. 
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4.3.3.5 Adaptation rule 

Analysis in Choi and Shin [36] has shown that FEC2 provides the best compromise between 
the performance and complexity. Here we consider how to adapt the error-control code based 
on adaptive FEC2. First, we define the set of codes for adaptive use as cq, c\ , C2, C3, where 

Co =>• no coding 
c x (255,229,256) 
c 2 (255, 153,256) 
c 3 (255,77,256) 

According to Choi and Shin [36], adaptation points (or symbol error probabilities at which 
the code rate would be adapted) are determined by P s = 7.5 x 10~ 5 , 0.034, 0.17. We use a 
modified set of the adaptation points as 



p0,l 

s 


= 6.5 x 10 


pi, 2 

s 


= 0.025 


p2,3 

s 


= 0, 15 



The reasons for choosing values lower than the original ones include: (1) adaptation at a 
higher P s than the original adaptation point can result in a very low throughput while adap- 
tation at a lower P s would not; and (2) an adaptation rule based on the original probabilities 
might result in too frequent adaptations (for the first reason). 

To develop an adaptation rule, we need to divide time-varying error patterns into two 
categories. One is the long-term variation in which error characteristics vary over several 
seconds to 10 seconds. A time- varying distance between the sender and receiver and shad- 
owing can cause this type of variation. The other is the short-term variation in which error 
characteristics vary over several milliseconds or less. The multipath fading environment 
resulting from multipath propagation of the transmitted signal and the mobile's relative 
movement can cause this type of variation. 

Similar to Section 4.1, error-control code adaptation is based on the channel state estima- 
tion, which is done using the decoding results at the receiver. That is, the receiver determines 
the desired code rate based on the decoding result of a received packet, then informs it to 
the sender through the ACK/NAK of each received packet. Because our adaptation is not 
based on any prediction but on the observation and feedback, it might not be able to han- 
dle the short-term variation of the channel condition well depending on the actual channel 
behaviour. However, it is effective for long-term variations. The code currently set at the 
receiver is denoted by c. 

4.3.3.6 Increase of code rate 

When a packet is successfully received (i.e. without any decoding failure), the receiver 
considers if it is time to change the code rate. Based on the currently set code at the receiver, 
the code rate is increased adaptively as follows: 

• When c — c, for i — 2, 3, the code is adapted to c,_i if the number e M of symbol errors 
out of Nm,l RS symbols in the last frames is smaller than P {l ~ l) ' 1 N m ,l for the smallest 
L such that P^'^Nm.l > 1. 
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• When c — c\, the code is adapted to cq if there was only a single or no symbol error during 
the last L frames received for the smallest L such that, P 0,1 Nm,l > 1 where N m ,l is the 
total number of RS symbols in the last L frames. Note that the number of symbol errors 
in an RS code segment is readily available from the decoder unless the decoding fails. 

4.3.3.7 Decrease in code rate 

When a packet needs to be retransmitted due to a CRC error for c ^ co, the code rate is not 
adapted in order to defer the adaptation decision until the next packet is received since a 
CRC error is very rare and will not usually happen over consecutive packets. On the other 
hand, when c = c, for i — 1,2 and a packet needs to be retransmitted due to RS decoding 
failures, the code rate is decreased by one step, i.e. c, =>■ c i+ \ for i = 1, 2, if any of the 
following conditions occurs: 

• when all RS segments of a packet result in decoding failures; 

• when more than a half of all RS segments in the last two packets result in decoding 
failures; 

• when three of the last 10 received packets result in decoding failures. 

The above adaptation rule for code-rate decrease may appear very ad hoc but, with the third 
condition, we try to limit the retransmission probability to under 0.3 since a similar or better 
throughput efficiency can otherwise be achieved with the next lower rate code. The first and 
second conditions, on the other hand, render prompt adaptation for an abrupt change of the 
channel condition. 

Now, when c = cq, the code is adapted to c\ if there were more than one CRC error 
detections in the last 10 received packets in order to keep the retransmission probability 
under 0.2. When the code rate is adapted to the decrease, and it is known to the sender, 
the sender encodes packets with the adapted code rate. The packets which need to be 
retransmitted are also re-encoded. Note that, for example, 229 symbols encoded by (255, 
229, 256) RS code cannot be reencoded by (255, 153, 256) RS code. For a packet encoded 
by c, code to be re-encoded by code c,-+i, the user data is divided first into two equal-sized 
packets (i.e. IP fragmentation) if the user data cannot be accommodated in c,-+i, then each 
encoded by c,-+i. 

Finally, the code rate can also be decreased by the sender's decision. That is, upon 
expiration of the retransmission time out, the code is set to C3. The rationale behind this 
adaptation is that a time-out happens due to the corruption of the MAC header or the loss of 
ACK/NAK while both the header and ACK/NAK are protected by very strong codes, thus 
implying that the channel is very bad. 

4.3.4 Performance of adaptive FEC2 
4.3.4.1 Link model 

A very simple wireless network environment is used to evaluate the proposed adaptive error- 
control scheme. There is only one sender and one receiver in the network, where the sender 
is assumed to have an infinite amount of information to send. Therefore, all the packets are 
transmitted in a full-length MAC frame composed of six RS code segments. Immediately 
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Figure 4.18 Two-state Markov chain model for the channel. 

after receiving/decoding a packet, the receiver sends an ACK/NAK for the packet. Upon 
receiving the ACK/NAK, the sender determines whether to transmit a new packet or retrans- 
mit the previously sent one. When the retransmission time-out happens consecutively, the 
retransmissions are backed off exponentially. That is, for 2, 3, 4, . . . consecutive time-outs, 
the retransmission is deferred for the time of 1, 2, 4, ... full-length packet transmissions, 
because consecutive time-out expirations implies that the channel is too bad for a packet to 
go through successfully. 

4.3.4.2 Channel model 

The channel is modelled by a two-state Markov chain, which is widely accepted to model the 
multipath fading channel, as shown in Figure 4. 18. The channel state is either good or bad 
and can change on bit boundaries, that is, the channel condition stays in a state during one bit 
duration. Following the widely accepted Rayleigh fading model, which corresponds to the 
case of no line-of-sight path between the sender and receiver, we can derive the transition 
probabilities fo,i and fi o as follows. First, Rappaport [41] provides the level-crossing rate, 
defined as the expected rate at which the Rayleigh fading envelope, normalized to the 
local root mean square (rms) signal level R lms , crosses a threshold level in a positive-going 
direction: 



where the maximum Doppler frequency is given by f m — v/k for the mobile speed v and 
the wavelength X of the carrier and the normalized threshold fading envelope is given by 
p = R/Rnns, Next, the average fade duration, defined as the average period of time for 
which the received signal is below the threshold level, is given by [41] 



P./mV2jr 

Using the above formulas and assuming steady-state conditions, the probabilities fiQ and 
jjLi that the channel is in good and bad states, respectively, are given by [38] 




(4.32) 



1 



(4.33) 



Mo = 



IJNr ~ Tf 
1/ATr 



and /x 0 = 



l/Njt 



(4.34) 
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Finally, the state transition probabilities can be approximated by [42] 

N R N r 

= aIld 'l.O = ~dI ( - 435 ^ 

where R{ = /? t Mk, and R t is the symbol transmission rate. With the transmission rate R t = 
1 Mbps, the mobile speed v — 2 km/h (pedestrian speed), the carrier frequency / = 900 
MHz (A = c/f — l/3m), and the normalized threshold fading envelope p — 0.3, we obtain 

lx Q = 0.914, //! = 0.0861 

and 

f 0 ,i = 1.253 x 10~ 6 , *o,o = l-*b,i fi, 0 = 1.331 x 10" 5 , f u = 1 - fi,o 

Assuming the binary phase shift keying (BPSK) modulation, the bit error rates (BERs) Pb,o 
and Pb,i when the channel is in good and bad states, respectively, can be calculated as [43] 

n-i 

Pb.i= J P b/Y fi(y)dy (4.36) 

Yt 

where the BER for a given SNR y is 

P b /y = Q(VZy) (4-37) 

the conditional distribution of the instantaneous SNR y in a given state with the mean SNR 
y is 

L e -r/f 

*™ = e-^-e-n-/. (438) 

for yi < y < and the set yo, Yi] — {oo, p 2 y, 0}. Note that the mean SNR y de- 
pends on the transmitted power, signal attenuation over the channel, and others. 



4.3.5 Simulation results 

Figure 4.19 shows the throughput efficiencies as the mean SNR increases for five different 
cases of adaptive FEC2: (1) two marked with 'fixed w/K' are the nonadaptive versions of 
FEC2 with (255, K, 256) codes; (2) one marked with 'w/exp backoff is the adaptive FEC2 
we are considering; (3) one marked with 'w/o exp backoff is the adaptive FEC2 without 
exponential back-off; and (4) one marked with 'ideal' is an ideal version of FEC2, in which 
the code rate is determined after seeing errors in the received packet; that is, if these errors 
are uncorrectable even with r c — 0.3 code, the packet is assumed not to be transmitted at all 
(i.e. the transmission is deferred), while the code which can correct all these errors with the 
maximum code rate is selected otherwise. Basically, there is no retransmission in the ideal 
version. This ideal and unrealistic version can be used as a reference (or an upper bound) 
of the throughput efficiency. The figure demonstrates in which region and how much the 
adaptive scheme offers better performance. 
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Figure 4.19 Throughput efficiency vs mean SNR. 

4.4 STOCHASTIC LEARNING LINK LAYER PROTOCOL 

Link layer ARQ protocols such as SW ARQ and SR ARQ, considered so far, recover from 
errors by retransmitting the erroneously transmitted packets regardless of the channel state. 
Such persistent retransmission schemes may lead to wasted energy consumption due to 
unnecessary packet losses and therefore additional retransmissions [44]. Discussions on 
this problem can be found in References [44-47]. In this section, we discuss a link layer 
algorithm that can produce soft predictions of the next network state given the past states, 
i.e. probabilities of the next state being any of the possible states are estimated. 

The goal is to predict the wireless network state without a priori knowledge of the state 
transition probabilities and accordingly change the link layer transmission control for energy 
efficiency. Towards this objective a stochastic iterative learning control technique is used 
based on a stochastic learning automaton [48, 49]. The state of the wireless network is learnt 
on the fly by the transmitter via simple adaptive stochastic learning based on feedback from 
the receiver regarding the quality of the wireless link. Then a link layer control algorithm 
optimizes the transmission policy accordingly. That is, in a time slot the link layer control 
protocol either chooses to transmit a packet or not based on the channel state prediction 
produced by the stochastic learning algorithm. Energy efficiency is traded off for delay 
during this process. 

4.4.1 Stochastic learning control 

A stochastic learning control algorithm based on variable structure learning automa- 
ton (VSLA) [48, 49] in general is characterized by a 5-tuple,{a, fi, p, r, C] where a — 
{a 1: «2, . . . , a r ] is the set of r actions from which the automaton can choose any action at 
time n, denoted by a ; (w),/J = {0, 1} is the set of binary response (feedback) generated by an 
unknown random environment for the chosen action, — 0 corresponds to a reward to the 
automaton for the chosen action while — 1 corresponds to a penalty, p — {p\, pi, ■ ■ ■ , p n ] 
contains the set of probabilities with which the corresponding actions are chosen, x is the 



136 



ADAPTIVE AND RECONFIGURABLE LINK LAYER 



stochastic iterative learning algorithm according to which the elements of the set p are 
updated over time. C = {c\, C2, . . . , c r ] is the set of penalty probabilities conditioned on 
the chosen action, i.e. c, = Pr [ft — 1 | a, ] that characterizes the unknown environment. It 
is reasonable to assume that arg min C is unique for a given random environment. The 
VSLA attempts to learn the optimal action (policy) that results in the minimum value in C. 
The VSLA can be characterized by the following equation: 

p ( n + 1) = t {«(«), p(n), p(n)} (4.39) 

where the ith element of the set p(n) at time n is 

Pi(n) — Pr(a(w) = a,), i — 1, 2, . . . , r 

r 

Pi (n) = 1, Vn and p,(l) = 1/r, V/ (4.40) 

1=1 



4.4.2 Adaptive link layer protocol 

The two-state Markov chain model (Gilbert-Elliot model) from Figure 4.18 is also used 
in this section. The transmission energy (E t ), mean number of retransmissions (R) and the 
packet error rate (P ep ) for an ARQ scheme are related as [45] 

E t = (R + l)P/C 

R-l 

R = (l-P ep )J2iPi P +RP^ (4-41) 

i=l 

Pep = P G [1 - (1 - P eC ) L ] + PB [1 - (1 - PeB) 1 ] 

In Equation (4.41) £ t is the total transmission energy, P is the transmit power, C is the 
bit rate of the channel, P e G and P e B are the bit error rates in states G (good) and B (bad), 
respectively, L is the packet length in bits, R is the maximum number of retransmissions 
allowed, P ep is the packet error probability and, Pc and P B are the steady state probabilities 
of G and B states. As seen from Equation (4.41), the total energy consumed is directly 
proportional to the mean number of retransmissions, indicating that considerable energy 
can be saved if the number of retransmissions is reduced significantly. Therefore, smart 
link layer protocols that recognize the wireless channel state and adaptively control the 
transmission/retransmission policy could result in significant energy savings. 

In this section an adaptive link layer protocol is presented that adjusts itself over time 
to arrive at the optimal policy for a fixed but unknown finite state wireless channel model. 
The link layer protocol employs a VSLA with two actions in its action set: a\, transmit a 
packet; and 012, do not transmit a packet. 

(1) If the transmitter sends a packet, it waits for a fixed time period to receive an 
acknowledgement. If an ACK is received within this period then the action ct\ is 
rewarded (ft — 0); if not a time-out occurs and a\ is penalized (fi — 1). 

(2) Then, a reward-penalty learning algorithm, as discussed later in this section [see 
Equation (4.42)], is used to update the action probabilities of the set p(n) = 
\pi(n), p 2 (n)}. 

(3) The next transmission decision is chosen according to the probability distribution 
p(n). 
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This process continues until the probability vector p(n) converges when the state of the 
channel is predicted accurately one time unit ahead. We assume that the reverse channel 
or the feedback channel is error free. The effect of an erroneous feedback channel and 
feedback delays are similar to those discussed in Section 4.1. It is also assumed that an 
ACK is generated for every packet transmitted. Depending on the needs of the situation, the 
feedback and updating can be also done after every few packets with some minor changes 
to the above algorithm. 



4.4.2. 1 Reward-penalty algorithm 

When ot\ is chosen then p\(n) and p 2 (n) are updated linearly depending on P(n); however, 
when a 2 is chosen both p\(n) and p 2 (n) are left unchanged. This is because, when the 
link layer chooses not to transmit a packet, there will be no feedback from the receiver. 
This stochastic learning control algorithm that predicts the channel state and controls the 
retransmission policy defined as p (n + 1) = x {a(n), /8(w), p(n)} can be described as 

if a(n) — a\ and ft(n) — 0 
pi(n + 1) = pi(n) + a{\ - p\{n) - A) 
p 2 {n + 1) = p 2 (n) - a(p 2 (n) - A) 



if a(n) — d\ and ji{n) — 1 
pi(n + 1) = pi(n) -b( Pl (n)-B) (4.42) 
p 2 (n + 1) = p 2 (n) + b(l - p 2 (n) - B) 



if a{n) — a 2 
P i(n + 1) = p\(n) 
p 2 {n + 1) = p 2 (n) 

In Equation (4.42), a and b are reward and penalty parameters that control the speed and 
accuracy with which the VSLA predicts and tracks the time-varying wireless channel state. 
Parameters A and B control the probability with which the link layer can explore both 
actions even while deviating from the currently most probable action. They also determine 
the amount of exploration vs exploitation during learning. 



4.4.2.2 Performance results 

The simulation scenarios consists of two nodes, with one node acting as a transmitter while 
the other is a receiver. The channel between the two nodes is simulated as a two-state 
Markov model (Figure 4.18). With probability 1 packets are dropped in the B state. Various 
values for the packet loss rate in the G state were chosen to observe the performance. 
A transmission rate of 11 Mbps was assumed as it is typical in a present-day wireless 
network such as a wireless LAN. The packet size L was fixed at a constant 1000 bytes. 
No upper limit on the number of retransmissions was set. In each simulation the number 
of packets transmitted by the sending node to the receiving node was 5000 and the results 
were averaged. The parameters A — B — 0.02 was fixed throughout the simulations and 
a = b — 0.3 unless otherwise stated. If the link layer transmitted a packet then a timer was 



138 ADAPTIVE AND RECONFIGURABLE LINK LAYER 



50 




Correlation coefficient channel (p) 



Figure 4.20 Percentage energy savings vs channel burstiness when packet error rate in G 
state is 0 %. Performance for different choices for the delay period S. 

set equal to 2.5 times the time taken by a packet to reach the destination. If an ACK was 
not received before the timer expires (time-out) then the packet was considered lost and 
the action probabilities updated. If the chosen action was not to transmit, then a back-off 
timer with a time period S was used. After the expiry of the time period S the transmitting 
node again decided the next action to be generated depending on the action probabilities in 
{pin)}. 

Figure 4.20 shows the percentage savings in transmission energy (pi) obtained using 
the adaptive link layer protocol in comparison with an SW ARQ protocol for 0 % packet 
loss rate in the G state. Recall that the packet loss rate in the B state was assumed to 
be 100%. With respect to notation from Figure 4.18, the correlation coefficient of the 
channel p for fo,o = is p = fo,rj — f o,i = — h,o is a measure of the burstiness of the 
channel. In these simulations the parameter S was varied in multiples of v, where l/v is 
the transmission rate. As seen from the figure the energy savings are substantial (nearly 
50 %) when the correlation coefficient of the channel (p) is high. This is because when 
the correlation coefficient is high the learning algorithm is able to predict the state of 
the channel more accurately as it utilizes the higher 'memory' in the channel. When the 
correlation coefficient is small, the energy savings reduce (to nearly 10 %) as the behavior 
of the channel becomes less predictable. Nevertheless, for both high and low values of p 
the adaptive link layer protocol conserves energy when compared with the traditional ARQ 
scheme. 

The tradeoff in higher energy savings is an increased mean delay per packet transmission 
A, as seen in Figure 4.21. We observe that the difference in additional mean packet delay 
(w.r.t. the SW ARQ) between S — v and S — lOv is reasonably small. We reiterate that 
S is a user-defined parameter which can be tuned to a specific application's mean delay 
requirements. Finally, we note that similar performances can be observed when the proposed 
method is also compared with other traditional ARQ protocols. 
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Figure 4.21 Mean delay per packet vs channel burstiness when G state packet loss rate is 
equal to 0 %. 

4.5 INFRARED LINK ACCESS PROTOCOL 

Recent growth in laptop computers and on portable devices, such as personal digital assis- 
tants (PDAs) and digital cameras, has led to an increasing demand for information transfer 
from or between portable devices [52]. Digital representation of information is expanding 
to new devices such as video and photocameras. New devices have 'computer-like' capabil- 
ities for storing and retrieving information such as mobile phones and portable information 
gathering appliances. Computer manufacturers have adopted the Infrared Data Association 
(IrDA) standard [53] and almost every portable computer and all Windows CE device on 
the market today contains an infrared (IR) port according to standards developed by IrDA. 
Laptop computers, PDAs, digital cameras, mobile phones and printers are examples of 
devises with IrDA links. More than 40000000 devices are shipped each year with IrDA 
ports [54] capable of using the unregulated IR spectrum for their cable-less communication 
needs. The IrDA standard addresses low-cost, indoor, short-range, half duplex, point-to- 
point links [55]. The IrDA physical layer specification (Ir-PHY) [56] supports optical links 
from 0 to at least 1 m, an angle of ±15° at a BER of less that 10~ 8 . Ir-PHY version 1.0 
serial IR (SIR) specification [57] supported data rates up to 1 15.2 kb/s using standard serial 
hardware, Ir-PHY version 1.1 fast IR (FIR) [58] extended the data rate to 4 Mb/s, and fi- 
nally Ir-PHY version 1 .3 very fast IR ( VFIR) [59] specification added the 1 6 Mb/s link rate. 
The IrDA hardware is controlled by a link-layer protocol, the IrDA link access protocol 
(IrLAP) [60, 61]. IrLAP is based on the widely used high-level data link control (HDLC) 
protocol operating in NRM (see Chapter 1). The performance of IrDA optical wireless links 
may be measured by the throughput which can be drawn at the IrLAP layer. In Vitsas and 
Boucouvalas [62, 63] performance evaluation of the IrLAP protocol and deriving optimum 
values for link layer parameters for maximizing throughput is presented. In the literature, a 
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mathematical model for the IrLAP throughput using the concept of a frame's 'virtual trans- 
mission time' is presented [52, 53] based on the HDLC analysis model presented in Williams 
[54]. However, this model does not lead to a simple formula for the IrLAP throughput. In 
Vitsas and Boucouvalas [57, 58], a new mathematical model using the average window 
transmission time (WTT) is developed. By taking advantage of IrLAP half-duplex opera- 
tion, this model leads to a simple closed form formula for IrLAP throughput. The formula 
relates throughput to physical layer parameters, such as link BER, link data rate and mini- 
mum turnaround time, and with link layer parameters such as frame size window size and 
frame overhead. As this equation gives an intuitive understanding of the performance of 
IrDA links, it is very valuable for designers and implementers of such links. By setting the 
first derivative equal to zero, the optimum values for window size and frame length that 
maximize throughput are derived. Formulas for all IrLAP time-consuming tasks are also 
presented, allowing evaluation of link parameter values to throughput performance. IrLAP 
performance is examined for various link parameters, such as BER, data rate and window 
size and compared with optimum performance achieved using optimum window size and 
frame length values. Optimum window and frame-size values can be easily implemented 
and result in significant throughput increase, especially for links experiencing high BERs. 
However, implementing optimum frame-size values on retransmissions requires buffer 
reorganization at a low level. 

4.5.1 The IrLAP layer 

IrLAP is the IrDA data link layer. It is designed based on the pre-existing HDLC and 
synchronous data link control protocols (see Chapter 1 , also Reference [60]). IrLAP stations 
operate in two modes: the normal disconnect mode during the contention period and the 
NRM during the connection period. In the contention period, a station advertises its existence 
to the neighboring stations along with the link parameters it supports and wishes to employ 
during the connection period. One of the participating stations becomes the primary station. 
Any station may claim to become the primary station but, at the end of the contention period, 
only one station is granted the primary role and all other stations are assigned the secondary 
role. All data traffic during the connection period is sent to or from the primary station. A 
secondary station wishing to communicate to another secondary station does so through the 
primary station. The parameters negotiated and agreed on during the contention period are 
given below: 

(1) Data rate (C) - this parameter specifies the station's transmission rate. 

(2) Maximum turnaround time (T mm ) - this parameter specifies the maximum time 
interval a station can hold transmission control. For data rates less than 1 15.2 kb/s, 
the maximum turnaround time must be 500 ms. A smaller value may be agreed 
between the two stations for 115.2 kb/s or higher data rates. 

(3) Data size (I) - this is the maximum length allowed for the data field in any re- 
ceived information frame (7 frame). This parameter has an upper limit of 2048 bytes 
(16 kb). 

(4) Window size (W max ) - this is the maximum number of unacknowledged frames / 
a station can receive before it has to acknowledge the number of frames received 
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Figure 4.22 IrDA SIR and FIR frame structure. 



correctly. An acknowledgement may be requested by the transmitting station before 
the window size is reached. This parameter has an upper limit of 7 for data rates up 
to 4 Mb/s and 127 for 4 and 16 Mb/s [60, 61]. 

(5) Minimum turnaround time (f ta ) - this is the time required by the station's receive 
circuit to recover after the end of a transmission initiated from the same station 
(turnaround latency). Each station must wait a minimum turnaround time delay 
when moving from receive mode to transmit mode to ensure that the receive circuit 
of the station that was transmitting is given enough time to recover. This is the time 
required to change link direction. 



Both stations must use the same data rate. However, parameters (2)-(5) are negotiated and 
agreed independently for each station. The IrDA frame structure is shown in Figure 4.22. 
The FCS contains a 16 b CRC for data rates up to 4 Mb/s and a 32 b CRC for 4 Mb/s and 
higher rates. IrLAP employs the following frame types: 



(1) Unnumbered frames (U frames) are used for link management. 'U frame' functions 
include discovering and initializing secondary stations, reporting procedural errors 
not recoverable by retransmissions, etc. 

(2) / frames carry information data across the link during the connection period. The 
/-frame control field contains send and receive frame counts to ensure ordered frame 
reception. 

(3) Supervisory frames (S frames) assist in information data transfer, although S frames 
never carry information data themselves. They are used to acknowledge correctly 
received frames, request an acknowledgement from the communicating station, con- 
vey station conditions, etc. 
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The control field contains an identifier, which determines the frame type. Depending on 
frame type, the control field may contain a send sequence number N s , used to number the 
transmitted frames. It may also contain a receive sequence number iV r , used to indicate 
the expected sequence number of the next / frame. SIR and FIR specifications employ an 
8 b-long control field (Figure 4.22). N s and N r occupy 3 b each in the control field, thus, 
N s and 7V r cycle through values from 0 to 7 and maximum window size is 7. Very fast 
infrared (VFIR) specification extended the control field to 16 b for the 4 and 16 Mb/s data 
rate IrDA links. In this case, N s and 7Y r occupy 7 b each, cycling through values 0 to 127 
and a maximum window size of 127 is allowed. 

Within the control field, the P/F bit implements token passing between stations. When it 
is set by the primary station, it is the poll (P) bit. When it is set by the secondary station, it 
is the final (F) bit. The primary uses the P bit to reverse link direction and solicit a response 
from the secondary. The secondary responds by transmitting one or more frames and by 
setting the F bit of the last frame it transmits, thus, reversing link direction and returning 
transmission control to the primary. IrLAP primary and secondary stations also employ the 
P timer. P timer is assigned with the maximum turnaround time (r max ) agreed between 
stations during the contention period and represents the maximum time a station can hold 
transmission control. Each station starts the P timer upon reception of a frame with the P/F 
bit set and stops the P timer when it transmits a frame with the P/F bit set. If the P timer 
expires, meaning that the station holds transmission control longer than allowed, the station 
immediately sends a receive ready (RR) S frame with the P/F bit set to pass transmission 
control. The primary station also employs an F timer to limit the time a secondary station 
can hold transmission control. The primary starts the F timer upon transmission of a frame 
with the P bit set and stops the F timer upon reception of a frame with the F bit set. P-timer 
expiration means that the secondary failed to return transmission control within the agreed 
time period. Since the secondary's P -timer operation guarantees that this never happens, 
f-timer expiration can only be explained by the loss of either the frame containing the P 
bit or the frame containing the F bit. The primary resolves this situation by transmitting an 
RR frame with the P bit set when theF timer expires. 

4.5.2 IrLAP functional model description 

In this section, the transmission of a large amount of information data from the primary 
to the secondary station is considered. The saturation case is assumed, where the primary 
station always has information data ready for transmission. 

The parameters used in the model are shown in Table 4.4. In the contention period, the 
primary station determines the window size N it will employ. N represents the maximum 
number of / frames the primary can transmit before soliciting an acknowledgement. The 
maximum window-size parameter W m . dx is negotiated and agreed between the two stations 
during the contention period. However, the maximum time a station can hold transmission 
control r max must always be obeyed and, according to IrLAP specification [60], T m . dx com- 
bined with frame length and link data rate may limit the window size applied. In other 
words, if the time needed for transmitting W m . dx frames carrying 'frame length' information 
bytes at the link data rate exceeds T m . dx , then a smaller window size must be employed. 
Thus, N is given by 



N - min{W max , floor(r max /f/)} 



(4.43) 
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Table 4.4 System parameters 



Parameter 


Description 


Unit 


C 


Link data bit rate 


b/s 


Pb 


Link bit error rate 


— 


P 


Frame error probability 


— 


I 


/-frame length data length 


b 


I' 


S-frame length/7-frame overhead 


b 


U 


Transmission time of an /-frame 


s 


^/max 


Transmission time of an /-frame with 






16 k bits user data 


s 


ts 


Transmission time of an S-frame 


s 


ha 


Minimum turn-around time 


s 


^ack 


Acknowledgement time 


s 


?max 


Maximum turn-around time 


s 


%out 


/^-timpr timp-ont neriod 




^max 


Maximum window size 


Frames 


N 


Window size 


Frames 


D t 


Frame throughput 


Frame/s 


D h 


Data throughput 


bits/s 



where min is 'the lesser of and floor is 'the largest integer not exceeding'. In this section, 
Tmax is always fixed to 500 ms. The information transfer procedure used in the model 
is presented in Figure 4.23. Each node holds three variables, V s for counting / frames 
transmitted, V T for counting / frames received, and w indicating the number of the remaining 
/ frames the station can transmit before reversing link direction. The primary also employs 
an F timer for limiting the secondary's transmission period. When the primary station 
sends an / frame, the N s subfield of the frame's control field is assigned the current V s value 
and is increased by one (modulo 8 or 128 depending of the control field size employed). 
The primary also makes a buffer copy of the frame for possible retransmissions. Since the 
primary always has information ready for transmission, it immediately checks the w value. 
If w is not equal to 1 , the primary reduces w by 1 , transmits the / frame with the P bit not 
set and the actions previously described are repeated. When w reaches 1, indicating that the 
next frame should be the last frame in the window transmission, the primary sets the bit P 
to poll the secondary and transmits the / frame. The primary also assigns N to w for the 
next N window frame transmission and starts the F timer. 

When the secondary station receives an / frame, it compares the received frame sequence 
N s value with the expected V r value. If N s equals V T (the received frame is in sequence), 
Vr is increased by 1 (modulo 8 or 128) and information data is extracted and passed to the 
upper layer. If the received frame is not in sequence (one of the previous / frames in current 
window transmission was lost due to a CRC error), the frame is discarded and V r remains 
unchanged. The secondary station also checks the P bit. If the P bit is set and, as the current 
model assumes that the secondary station never has information for transmission, it awaits a 
minimum turnaround time r ta to allow for the hardware recovery latency and transmits an S 
frame with the F bit set. The S frame's jV r field contains V r , a value informing the primary 
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Figure 4.23 Information transfer procedure. 
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of the number of / frames received correctly and in sequence in the previous window 
transmission. When the primary receives the frame, it resumes /-frame transmission as 
transmission control was returned to the primary by means of the F bit. The primary first 
compares the received S frame's N t with its current V s value. If N T equals V s (all frames in 
the previous window transmission were received correctly by the secondary), the primary 
transmits / frames containing new information data to the secondary. If N T is not equal 
to V s , one or more / frames in the previous window transmission are lost. The primary 
retransmits buffered frames starting from the indicated N r position before new data can be 
transmitted. 

If the last / frame that contains the P bit is lost, the secondary station fails to respond as 
it does not realize that it has transmission control. The situation is resolved by the primary's 
Z^-timer expiration. The primary realizes that the secondary failed to respond within the 
agreed time period and transmits an S frame forcing the secondary to respond. In the current 
model, S frames are considered small enough to always be received error-free. 

The saturation case model considered in this section can be summarized as follows. The 
transmitting station always has information ready for transmission. As a result, it transmits 
a window of N consecutive / frames and reverses the link direction by setting the P bit in 
the last / frame. The receiver awaits a minimum turnaround time and responds with an RR 
S frame indicating the expected sequence number of the next frame. RR frames always have 
the F bit set. The transmitter determines the number of frames correctly received before 
any error(s) occurred and repeats the erroneous frame and the frames following it, in the 
next window, followed by new frames to form a complete N frame transmission. If the last 
frame in a window transmission is lost, the receiver fails to respond as the P bit is lost. 
When F timer expires, the primary station sends an RR S frame with the P bit set forcing 
the secondary station to acknowledge correctly received frames. Additional discussion on 
infrared link access protocol can be found in References [62-65]. 
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Introductory material on MAC is presented in Appendix B (please go to www.wiley.com/go/ 
glisic). Within this chapter we focus only on few specific solutions in WLAN, ad hoc and 
sensor networks. 

5.1 WLAN ENHANCED DISTRIBUTED COORDINATION FUNCTION 

The last few years have witnessed an explosive growth in 802. 1 1 WLAN [ 1 ] . Unfortunately, 
the current 802. 1 1 MAC does not possess any effective service differentiation capability, 
because it treats all the upper-layer traffic in the same fashion. Hence, a special working 
group, IEEE 802.1 le [2-12], was established to enhance the 802.11 MAC to meet QoS 
requirements for a wide variety of applications. The 802.1 le EDCF (extended distributed 
coordination function) is an extension of the basic DCF mechanism of current 802.11 
(Chapter 1). Unlike DCF, EDCF is not a separate coordination function, but a part of a 
single coordination function of 802. lie called the hybrid coordination function (HCF). The 
HCF combines both DCF and PCF (point coordination function) from the current 802.1 1 
specification with new QoS specific enhancements. It uses EDCF and a polling mechanism 
for contention-based and contention-free channel access, respectively. In EDCF, each sta- 
tion can have multiple queues that buffer packets of different priorities. Each frame from 
the upper layers bears a priority value which is passed down to the MAC layer. Up to eight 
priorities are supported in a 802. lie station and they are mapped into four different access 
categories (AC) at the MAC layer [3]. A set of EDCF parameters, namely the arbitration 
interframe space (AIFS[AC]), minimum contention window size (CWMin[AC]) and max- 
imum contention window size (CWMax[AC]), is associated with each access category to 
differentiate the channel access. AIFS[AC] is the number of time slots a packet of a given 
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AC has to wait after the end of a time interval equal to a short interframe spacing (SIFS) 
duration before it can start the backoff process or transmit. After i(i > 0) collisions, the 
backoff counter in 802. 1 le is selected uniformly from [ 1 , 2' x CWMin[AC]], until it reaches 
the backoff stage i such that 2' x CWMin[AC] — CWMax[AC]. At that point, the packet will 
still be retransmitted, if a collision occurs, until the total number of retransmissions equals 
the maximum number of allowable retransmissions (RetryLimit[AC]) specified in 1ST WS1 
[13], with the backoff counters always chosen from the range [1, CWMax[AC]]. Since mul- 
tiple priorities exist within a single station, it is likely that they will collide with each other 
when their backoff counters decrement to zero simultaneously. This phenomenon is called 
an internal collision in 802.1 le and is resolved by letting the highest priority involved in 
the collision win the contention. Of course, it is still possible for this winning priority to 
collide with packets from other station(s). 

Performance example - the basic parameters used in simulation are: 

Packet pay load size 8184 bits at 11 Mbps 
MAC header 272 bits at 1 1 Mbps 
PHY header 192 bits at 1 Mbps 
ACK 1 12 bits + PHY header 
Propagation delay 1 |j.s 
Slot time 20 (xs 
SIFS 10 |LLS 

The QoS parameters, i.e. CWMin[AC], CWMax[AC] and AIFS[AC], used in the following 
discussion are similar to the values specified by IEEE 802. lie Working Group for voice 
and video traffic [12]. 

Simulation with the same parameters is also presented for example, in Tao and Panwar 
[14]. In Figure 5.1, each station contains two priorities, which are only differentiated by 
the internal collision resolution algorithm discussed before. As expected, internal collision 
resolution by itself can provide some differentiation for channel access between different 
priorities. 

The two priorities in each station are further differentiated by AIFS and CWMin/CWMax 
in Figures 5.2 and 5.3, respectively. It can be seen that AIFS may have a more marked 
effect on service differentiation than CWMin/CWMax alone. When all QoS mechanisms in 
802.1 le EDCF are enabled, the resulting throughput is shown in Figure 5.4. Comparing 
Figure 5.4 with Figure 5.2, we find that the QoS differentiation effect of AIFS is almost 
identical to the aggregate impact of AIFS plus CWMin/CWMax. 

All the figures reveal that, as the number of stations in the network grows, the throughput 
for each priority as well as the total throughput drop fairly fast, especially when the QoS- 
specific parameters are small. Under heavy load assumption, the throughput for low priority 
often decreases to almost zero before the number of stations reaches 10. For this region, the 
high priority traffic dominates. 



5.2 ADAPTIVE MAC FOR WLAN WITH ADAPTIVE ANTENNAS 

Smart antennas (or adaptive array antennas) have some unique properties that enable us 
to achieve high throughputs in ad hoc network scenarios. A transmitter equipped with a 
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High priority (simulation) 
Low priority (simulation) 
Total (simulation) 
High priority (analysis) 
Low priority (analysis) 
Total (analysis) 




10 12 
Number of nodes 



Figure 5.1 Two priorities differentiated only by internal collision resolution: CWMin/ 
Max[0] = 8/16, CWMin/Max[l] = 8/16, AIFS[0] = 2, AIFSfl] = 2. 




Figure 5.2 Two priorities with different A1FS values: CWMin/Max[0] = 8/16, CWMin/ 
Max[l] = 8/16, AIFS[0] = 2, AIFS[1] = 3. 
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Low priority (simulation) 
Total (simulation) 
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Figure 5.3 Two priorities with different CWMin and CWMax. AIFS[0] = 2, AIFS[1] 
CWMin/Max[0] = 8/16, CWMin/Max[l] = 10/20. 




Figure 5.4 Two priorities with different CWMin, CWMax and AIFS: AIFS[0] 
AIFSfl] = 3, CWMin/Max[0] = 8/16, CWMin/Max[l] = 10/20. 
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smart antenna can form a directed beam towards its receiver and a receiver can similarly 
form a directed beam towards the sender, resulting in very high gain. A receiver can also 
identify the direction of multiple simultaneous transmitters by running DOA algorithms 
and use this information to determine the directions in which it should place the nulls. 
Placing nulls effectively cancels out the impact of interfering transmitters. In this paper 
we present a simple 802.1 lb-based MAC protocol called Smart-802.11b that explicitly 
uses these three properties of smart antennas (beamforming, DOA and nulling) to achieve 
high throughputs. The two protocols are called Smart-Aloha [15, 16] and Smart-802.11b 
and, as the name implies, these two protocols are modifications to the well-known Aloha 
and 802.11b protocols. In both cases, functionality at the MAC layer is added to allow it 
to directly control the antenna: the MAC layer controls the direction of the beam and the 
direction of the nulls. In addition, the antenna provides the MAC layer with DOA information 
for all transmissions it can hear along with signal strength information. The main results 
are that these protocols show a very high throughput while maintaining fairness. Table 5.1 
summarizes the main throughput results of the MAC protocols designed for directional 
antenna equipped nodes [15-26]. 

5.2.1 Description of the protocols 

Consider the case when a node a needs to transmit a packet to node b which is its one-hop 
neighbor. It is assumed that a knows the angular direction of b and it can therefore form a 
beam in the direction of b. However, to maximize SINR, b should also form a beam towards 
a and form nulls in the direction of all other transmitters. In order to do this, b needs to know 
two things - first, that a is attempting to transmit to it, and second, the angular direction 
of all the other transmitters that interfere at b. The two protocols discussed in this section 
answer these two questions somewhat differently. 

Smart-Aloha is a slightly modified version of the standard Slotted-Aloha protocol. To 
transmit a packet, a transmitter forms a beam towards its receiver and begins transmission. 
However, it prefaces its packet transmission with the transmission of a short (8 byte) pure 
tone (this is a simple sinusoid). Idle nodes remain in an omnidirectional mode and receive 
a complex sum of all such tones (note that the tones are identical for all nodes and thus 
we cannot identify the nodes based on the tone) and run a DOA algorithm to identify the 
direction and strength of the various signals. An idle node then beamforms in the direction 
of the maximum received signal strength and forms nulls in other directions, and receives 
the transmitted packet. If the receiver node was the intended destination for the packet, it 
immediately sends an ACK using the already formed directed beam. On the other hand, if 
the packet was intended for some other node, then the receiver discards it. 

A sender waits for an ACK immediately after transmission of the packet and if it does 
not receive the ACK, it enters backoff in the standard way. Thus, the Smart- Aloha protocol 
follows a Tone/Packet/ Ack sequence. The intuition behind the receiver beamforming in the 
direction of the maximum signal is that, because of the directivity of the antenna, there is a 
high probability that it is the intended recipient for the packet. However, in some cases, as 
in Figure 5.5, the receiver d incorrectly beamforms towards a because a's signal is stronger 
than b's. While this is not a serious problem in most cases, we can envision scenarios where 
the b — > d transmission gets starved due to a large volume of a — > c traffic. A possible 
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Figure 5.5 False beamforming. (Reproduced by permission of IEEE [27].) 

optimization is a single-entry cache scheme which works as follows: 

• If a node beamforms incorrectly in a given timeslot, it remembers that direction in a 
single-entry cache. 

• In the next slot, if the maximum signal strength is again in the direction recorded in 
the single-entry cache, then the node ignores that direction and beamforms towards the 
second strongest signal. If the node receives a packet correctly (i.e. it was the intended 
recipient), it does not change the cache. If it receives a packet incorrectly, it updates the 
cache with this new direction. 

• If there is no packet in a slot from the direction recorded in the cache, the cache is reset. 

The Smart-802.11b protocol is based on the 802.1 lb standard. As in the case of the Smart- 
Aloha protocol, transmitters beamform towards their receivers and transmit a short sender- 
tone to initiate communication. However, unlike Smart-Aloha, the transmitter does not 
immediately follow the tone with a packet. Instead, it waits for a receiver-tone and only 
then transmits its packet. After transmission of a packet, it waits for the receipt of an ACK. 
If there is no ACK, it enters backoff as in 802.1 lb. Figure 5.6 presents a state diagram of 
tone-based protocol. The behavior of the protocol in various states can be summarized as 
follows. 

5.2.1.1 Idle 

In case a node has no packet to send, it will remain in the idle state and set its antenna to 
operate in the omnidirectional mode. If it receives a sender-tone from some other node, it 
will move into the data receive wait state. On the other hand, if it wishes to send data, it will 
beamform in the direction of the receiver. It chooses a random number [0-CW] and sets 
the CW (contention window) timer 1 . When the CW timer expires, it sends a sender-tone 
in the direction of the receiver and moves to the ACK wait state. If, before the CW timer 
expires, the node receives a sender-tone from another node, it will freeze its CW timer and 
move to data receive wait state. 

5.2.1.2 Data receive wait 

A node will move to this state in the event it receives a sender-tone. The node will beamform 
towards the sender and then randomly defer transmitting the receiver-tone by choosing a 
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Figure 5.6 State diagram of the Smart-802.1 lb protocol. 

random waiting period of [0-32] x 20 /xs. The reason for deferring the reply is to mini- 
mize the chance of several receiver-tones colliding at sender 2. After transmitting a receiver- 
tone, the node remains in this state for 2r (twice the maximum propagation delay + tone 
transmission time). If it does not hear a transmission, it returns to the idle state. If it hears the 
start of a transmission, it remains in this state and receives the packet. It then discards the 
packet if the packet was meant for some other node If, however, the packet was meant for 
it, then it sends an ACK. 



5.2.1.3 Ackwait 

If the sender node receives a receiver-tone before the tone RTT timer goes off (which is 
twice the tone transmission time plus propagation delay), it will transmit the data packet. 
Reception of a valid ACK will move the node to the idle state, and if packets are there in 
the queue then it will schedule the one at the head of the queue. The node will move to the 
backoff state under two conditions: (1) a receiver-tone did not arrive; (2) an ACK was not 
received following transmission of the data packet. 



5.2.1.4 Backoff 

The node computes a random backoff interval (as in 802. 11) and remains in backoff for this 
time period (it also resets its antenna to omnidirectional mode). If, however, a sender-tone 
is received, it freezes the backoff timer and enters the data receive wait state. If the node 
is in backoff, upon expiration of the timer, it retransmits the sender-tone, increments the 
retransmit counter and enters the ACK wait state. A packet is discarded after the retransmit 
counter exceeds Max Retransmit = 7, as in the IEEE 802.1 1 standard. 

The reception of a data packet by a node may be interfered with by transmissions of 
sender-tones, receiver-tones or other data packets (since the protocol does not take care of 
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hidden terminals). A node engaged in receiving a data packet can dynamically form nulls 
towards new interferers, but this process takes some time (we model this time as the length 
of a sender-tone). Thus, the data packet will have errors due to this interference. This error 
is mitigated by relying on FEC codes as used in IEEE 802.1 le, where (224, 208) shortened 
RS codes are used. In 802. lie, an MAC packet is split into blocks of 208 octets and each 
block is separately coded using an RS encoder. A (48, 32) RS code, which is also a shortened 
RS code, is used for the MAC header, and CRC-32 is used for the FCS. 
Performance example - the simulation parameters are: 

Background noise + ambient noise = 143 dB 

Propagation model free space 

Bandwidth 1000 kHz 

Min frequency 2402 MHz 

Data rate 2000 kbps 

Carrier sensing threshold + 3 dB 

Minimum SINR 9 dB 

Bit error based on BPSK modulation curve 

Maximum radio range 250 m 

Packet size 16 kb 

Simulation time 200 s 

Single hop: number of nodes 20, area 100 x 100 m 
Multihop: number of nodes 100, area 1500 x 1500 

The existing 802. lib implementation in OPNET is modified to create Smart-802. lib. The 
modifications included adding the two tones (sender and receiver) as well as changing the 
FEC to the 802. lie specification. 

The performance of the protocol is presented for a single-hop case with 20 nodes and a 
five-hop case with 100 nodes using of 16 KB packets. The 16 antenna elements (for an effec- 
tive beamwidth of 400) were used. Figure 5.7 presents the aggregate one-hop throughput as 
a function of arrival rate for the one-hop case. One can see that 802. lib achieves a maximum 
throughput of 1 Mbps while Smart-802. 1 lb achieves a high of 8.5 Mbps and Smart- Aloha 
achieves a high of approximately 10.5 Mbps. In fact, the throughput of Smart-802. 1 lb and 
Smart- Aloha increases with arrival rate because of good spatial reuse of the channel. Figure 
5.8 plots the aggregate throughput of the protocol for the 100-node five-hop case; 802.1 lb 
reaches a maximum throughput of well below 0.5 Mbs while Smart- 802.11b reaches a 
maximum of 50 Mbs and Smart-Aloha reaches a maximum throughput of 60 Mbs. Again, 
the better spatial reuse of the channel given the directivity of the antenna is the reason for 
this performance improvement. 

5.3 MAC FOR WIRELESS SENSOR NETWORKS 

This section discusses an MAC protocol designed for wireless sensor networks (S-MAC). As 
will be discussed in Chapter 14, wireless sensor networks use battery-operated computing 
and sensing devices. A network of these devices will collaborate for a common application 
such as environmental monitoring. Sensor networks are expected to be deployed in an ad 
hoc fashion, with nodes remaining largely inactive for long time, but becoming suddenly ac- 
tive when something is detected. These characteristics of sensor networks and applications 
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Figure 5.7 Single-hop case with 20 nodes. (Reproduced by permission of IEEE [27].) 
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Figure 5.8 Five-hop case with 100 nodes. (Reproduced by permission of IEEE [27].) 

motivate an MAC that is different from traditional wireless MACs such as IEEE 802.11, 
described in previous sections, in several ways. Energy conservation and self-configuration 
are primary goals, while per-node fairness and latency are less important. S-MAC uses 
a few novel techniques to reduce energy consumption and support self-configuration. 
It enables low-duty-cycle operation in a multihop network. Nodes form virtual clusters 
based on common sleep schedules to reduce control overhead and enable traffic-adaptive 
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wake-up. S-MAC uses in-channel signaling to avoid overhearing unnecessary traffic. Fi- 
nally, S-MAC applies message passing to reduce contention latency for applications that 
require in-network data processing. 

Woo and Culler [28] examined different configurations of carrier sense multiple access 
(CSMA) and proposed an adaptive rate control mechanism, whose main goal is to achieve 
fair bandwidth allocation to all nodes in a multihop network. There is also some work 
on the low-duty-cycle operation of nodes, which are closely related to S-MAC. The first 
example is Piconet [29], which is an architecture designed for low-power ad hoc wireless 
networks. Piconet also puts nodes into periodic sleep for energy conservation. However, 
there is no coordination and synchronization among nodes about their sleep and listen 
time. The scheme to enable the communications among neighboring nodes is to let a node 
broadcast its address when it wakes up from sleeping. If a sender wants to talk to a neighbor, 
it must keep listening until it receives the neighbor's broadcast. In contrast, S-MAC tries 
to coordinate and synchronize neighbors' sleep schedules to reduce latency and control 
overhead. 

Perhaps the power-save (PS) mode in IEEE 802. 1 1 DCF is the most related work to 
the low-duty-cycle operation in S-MAC. Nodes in PS mode periodically listen and sleep, 
just like that in S-MAC. The sleep schedules of all nodes in the network are synchronized 
together. The main difference from S-MAC is that the PS mode in 802.1 1 is designed for a 
single-hop network, where all nodes can hear each other, simplifying the synchronization. 

As observed by Woo and Culler [28], in multihop operation, the 802.11 PS mode may 
have problems in clock synchronization, neighbor discovery and network partitioning. In 
fact, the 802. 1 1 MAC in general is designed for a single-hop network, and there are questions 
about its performance in multihop networks [30]. In comparison, S-MAC is designed for 
multihop networks, and does not assume that all nodes are synchronized together. Finally, 
although 802. 1 1 defines PS mode, it provides very limited policy about when to sleep, 
whereas in S-MAC, a complete system is defined. Tseng et al. [31] proposed three sleep 
schemes to improve the PS mode in the IEEE 802. 1 1 for its operation in multihop networks. 
Among them the one named periodically fully awake interval is the closest to the scheme of 
periodic listen and sleep in S-MAC. However, their scheme does not synchronize the sleep 
schedules of any neighboring nodes. The control overhead and latency can be large. For 
example, to send a broadcast packet, the sender has to explicitly wake up each individual 
neighbor before it sends out the actual packet. Without synchronization, each node has to 
send beacons more frequently to prevent long-term clock drift. 



5.3.1 S-MAC protocol design 

S-MAC includes approaches to reducing energy consumption from all the sources of energy 
waste such as: (a) idle listening; (b) collision; and (c) overhearing and control overhead. 
Before describing the components in S-MAC, we first summarize assumptions about the 
wireless sensor network and its applications. 

Sensor networks will consist of large numbers of nodes to take advantage of short-range, 
multihop communications to conserve energy (see Chapter 14). Most communications will 
occur between nodes as peers, rather than to a single base station. In-network processing is 
critical to network lifetime, and implies that data will be processed as whole messages in a 
store-and-forward fashion. Packet or fragment-level interleaving from multiple sources only 
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C A B D 

Figure 5.9 Neighboring nodes A and B have different schedules. They synchronize with 
nodes C and D respectively. 

increases overall latency. Finally, we expect that applications will have long idle periods 
and can tolerate latency on the order of network messaging time. 



5.3.2 Periodic listen and sleep 

As stated above, in many sensor network applications, nodes are idle for a long time if no 
sensing event happens. Given the fact that the data rate is very low during this period, it is 
not necessary to keep nodes listening all the time. S-MAC reduces the listen time by putting 
nodes into periodic sleep state. Each node sleeps for some time, and then wakes up and 
listens to see if any other node wants to talk to it. During sleeping, the node turns off its 
radio, and sets a timer (alarm clock) to wake itself later. 

A complete cycle of listen and sleep is called a frame. The listen interval is normally 
fixed according to physical-layer and MAC-layer parameters, such as the radio bandwidth 
and the contention window size. The duty cycle is defined as the ratio of the listen interval 
to the frame length. The sleep interval can be changed according to different application 
requirements, which actually changes the duty cycle. For simplicity, these values are the 
same for all nodes. All nodes are free to choose their own listen/sleep schedules. However, 
to reduce control overhead, we prefer neighboring nodes to synchronize together. That is, 
they listen at the same time and go to sleep at the same time. It should be noticed that not 
all neighboring nodes can synchronize together in a multihop network. Two neighboring 
nodes A and B may have different schedules if they must synchronize with different nodes, 
C, and D, respectively, as shown in Figure 5.9. 

Nodes exchange their schedules by periodically broadcasting a SYNC packet to their 
immediate neighbors. A node talks to its neighbors at their scheduled listen time, thus 
ensuring that all neighboring nodes can communicate even if they have different schedules. 
In Figure 5.9, for example, if node A wants to talk to node B, it waits until B is listening. 
The period for a node to send a SYNC packet is called the synchronization period. One 
characteristic of S-MAC is that it forms nodes into a flat, peer-to-peer topology. Unlike 
clustering protocols, S-MAC does not require coordination through cluster heads. Instead, 
nodes form virtual clusters around common schedules, but communicate directly with peers. 
One advantage of this loose coordination is that it can be more robust to topology change 
than cluster-based approaches. The downside of the scheme is the increased latency due 
to the periodic sleeping. Furthermore, the delay can accumulate on each hop. Later on, a 
technique that is able to significantly reduce such latency will be presented. 



5.3.3 Collision avoidance 

If multiple neighbors want to talk to a node at the same time, they will try to send when 
the node starts listening. In this case, they need to contend for the medium. Among con- 
tention protocols, the 802. 1 1 does a very good job on collision avoidance. S-MAC follows 
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similar procedures, including virtual and physical carrier sense, and the RTS/CTS (request 
to send/clear to send) exchange for the hidden terminal problem [32]. There is a duration 
field in each transmitted packet that indicates how long the remaining transmission will be. 
If a node receives a packet destined to another node, it knows how long to keep silent from 
this field. The node records this value in a variable called the network allocation vector 
(NAV) [33] and sets a timer for it. Every time the timer fires, the node decrements its NAV 
until it reaches zero. Before initiating a transmission, a node first looks at its NAV. If its 
value is not zero, the node determines that the medium is busy. This is called 'virtual carrier 
sense'. Physical carrier sense is performed at the physical layer by listening to the channel 
for possible transmissions. Carrier senses time is randomized within a contention window 
to avoid collisions and starvations. The medium is determined as free if both virtual and 
physical carrier senses indicates that it is free. 

All senders perform carrier sense before initiating a transmission. If a node fails to get 
the medium, it goes to sleep and wakes up when the receiver is free and listening again. 
Broadcast packets are sent without using RTS/CTS. Unicast packets follow the sequence of 
RTS/CTS/DATA/ACK between the sender and the receiver. After the successful exchange 
of RTS and CTS, the two nodes will use their normal sleep time for data packet transmission. 
They do not follow their sleep schedules until they finish the transmission. With the low- 
duty-cycle operation and the contention mechanism during each listen interval, S-MAC 
effectively addresses the energy waste due to idle listening and collisions. In the next section, 
details of the periodic sleep coordinated among neighboring nodes will be presented. Two 
techniques will be presented that further reduce the energy waste due to overhearing and 
control overhead. 

5.3.4 Coordinated sleeping 

Periodic sleeping effectively reduces energy waste on idle listening. In S-MAC, nodes 
coordinate their sleep schedules rather than randomly sleep on their own. This section 
details the procedures that all nodes follow to set-up and maintain their schedules. It also 
presents a technique to reduce latency due to the periodic sleep on each node. 

5.3.5 Choosing and maintaining schedules 

Before each node starts its periodic listen and sleep, it needs to choose a schedule and 
exchange it with its neighbors. Each node maintains a schedule table that stores the schedules 
of all its known neighbors. It follows the steps below to choose its schedule and establish 
its schedule table. 

(1) A node first listens for a fixed amount of time, which is at least the synchronization 
period. If it does not hear a schedule from another node, it immediately chooses 
its own schedule and starts to follow it. Meanwhile, the node tries to announce the 
schedule by broadcasting a SYNC packet. Broadcasting a SYNC packet follows 
the normal contention procedure. The randomized carrier sense time reduces the 
chance of collisions on SYNC packets. 

(2) If the node receives a schedule from a neighbor before choosing or announcing its 
own schedule, it follows that schedule by setting its schedule to be the same. Then 
the node will try to announce its schedule at its next scheduled listen time. 
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(3) There are two cases where a node receives a different schedule after it chooses and 
announces its own schedule. If the node has no other neighbors, it will discard its 
current schedule and follow the new one. If the node already follows a schedule 
with one or more neighbors, it adopts both schedules by waking up at the listen 
intervals of the two schedules. 

To illustrate this algorithm, consider a network where all nodes can hear each other. The 
node that starts first will pick up a schedule first, and its broadcast will synchronize all its 
peers on its schedule. If two or more nodes start first at the same time, they will finish initial 
listening at the same time, and will choose the same schedule independently. No matter 
which node sends out its SYNC packet first (wins the contention), it will synchronize the 
rest of the nodes. 

However, two nodes may independently assign schedules if they cannot hear each other 
in a multihop network. In this case, those nodes on the border of two schedules will adopt 
both. For example, nodes A and B in Figure 5.9 will wake up at the listen time of both 
schedules. In this way, when a border node sends a broadcast packet, it only needs to send 
it once. The disadvantage is that these border nodes have less time to sleep and consume 
more energy than others. 

Another option is to let a border node adopt only one schedule - the one it receives 
first. Since it knows that some other neighbors follow another schedule, it can still talk to 
them. However, for broadcasting, it needs to send twice to the two different schedules. The 
advantage is that the border nodes have the same simple pattern of periodic listen and sleep 
as other nodes. 

It is expected that nodes only rarely see multiple schedules, since each node tries to 
follow an existing schedule before choosing an independent one. However, a new node may 
still fail to discover an existing neighbor for several reasons. The SYNC packet from the 
neighbor could be corrupted by collisions or interference. The neighbor may have delayed 
sending a SYNC packet due to the busy medium. If the new node is on the border of two 
schedules, it may only discover the first one if the two schedules do not overlap. 

To prevent the case that two neighbors miss each other forever when they follow com- 
pletely different schedules, S-MAC introduces periodic neighbor discovery, i.e. each node 
periodically listens for the whole synchronization period. The frequency with which a node 
performs neighbor discovery depends on the number of neighbors it has. If a node does 
not have any neighbors, it performs neighbor discovery more aggressively than in the case 
where it has many neighbors. Since the energy cost is high during the neighbor discovery, it 
should not be performed too often. In a typical implementation, the synchronization period 
is 10 s, and a node performs neighbor discovery every 2 min if it has at least one neighbor. 

5.3.6 Maintaining synchronization 

Since neighboring nodes coordinate their sleep schedules, the clock drift on each node can 
cause synchronization errors. Two techniques can be used to make it robust to such errors: 
(1) all exchanged timestamps are relative rather than absolute; and (2) the listen period is 
significantly longer than clock drift rates. For example, the listen time of 0.5 s is more than 
10 times longer than typical clock drift rates. Compared with TDMA schemes with very 
short time slots, S-MAC requires much looser time synchronization. Although the long 
listen time can tolerate fairly large clock drift, neighboring nodes still need to periodically 
update each other with their schedules to prevent long-term clock drift. The synchronization 
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Figure 5.10 Timing relationship between a receiver and different senders. CS stands for 
carrier sense. 



period can be quite long. The measurements show that the clock drift between two nodes 
does not exceed 0.2 ms/s. 

As mentioned earlier, schedule updating is accomplished by sending a SYNC packet. 
The SYNC packet is very short, and includes the address of the sender and the time of its next 
sleep. The next sleep time is relative to the moment that the sender starts transmitting the 
SYNC packet. When a receiver gets the time from the SYNC packet, it subtracts the packet 
transmission time and uses the new value to adjust its timer. In order for a node to receive 
both SYNC packets and data packets, its listen interval is divided into two parts. The first 
one is for SYNC packets, and the second one is for data packets, as shown in Figure 5.10. 
Each part has a contention window with many time slots for senders to perform carrier 
sense. For example, if a sender wants to send a SYNC packet, it starts carrier sense when 
the receiver begins listening. It randomly selects a time slot to finish its carrier sense. If it 
has not detected any transmission by the end of that time slot, it wins the contention and 
starts sending its SYNC packet. The same procedure is followed when sending data packets. 

Figure 5.10 shows the timing relationship of three possible situations that a sender 
transmits to a receiver. Sender 1 only sends a SYNC packet. Sender 2 only sends a unicast 
data packet. Sender 3 sends both a SYNC and a data packet. 



5.3.7 Adaptive listening 

The scheme of periodic listen and sleep is able to significantly reduce the time spent on 
idle listening when traffic load is light. However, when a sensing event indeed happens, it is 
desirable that the sensing data can be passed through the network without too much delay. 
When each node strictly follows its sleep schedule, there is a potential delay on each hop, 
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whose average value is proportional to the length of the frame. For this reason, a mechanism 
is introduced to switch the nodes from the low-duty-cycle mode to a more active mode in 
this case. 

S-MAC uses an important technique, called adaptive listen, to improve the latency caused 
by the periodic sleep of each node in a multihop network. The basic idea is to let the node 
that overhears its neighbor's transmissions [ideally only request to send (RTS) or clear to 
send (CTS)] wake up for a short period of time at the end of the transmission. In this way, if 
the node is the next-hop node, its neighbor is able to immediately pass the data to it instead 
of waiting for its scheduled listen time. If the node does not receive anything during the 
adaptive listening, it will go back to sleep until its next scheduled listen time. 

Let us look at the timing diagram in Figure 5.10 again. If the next-hop node is a neighbor 
of the sender, it will receive the RTS packet. If it is only a neighbor of the receiver, it 
will receive the CTS packet from the receiver. Thus, both the neighbors of the sender and 
receiver will learn about how long the transmission is from the duration field in the RTS 
and CTS packets. So they are able to adaptively wake up when the transmission is over. 

The interval of the adaptive listening does not include the time for the SYNC packet as 
in the normal listen interval (see Figure 5.10). SYNC packets are only sent at scheduled 
listen time to ensure all neighbors can receive it. To give the priority to the SYNC packet, 
adaptive listen and transmission are not performed if the duration from the time the previous 
transmission is finished to the normally scheduled listen time is shorter than the adaptive 
listen interval. 

One should note that not all next-hop nodes can overhear a packet from the previous 
transmission, especially when the previous transmission starts adaptively, i.e. not at the 
scheduled listen time. Therefore, if a sender starts a transmission by sending out an RTS 
packet during the adaptive listening, it might not get a CTS reply. In this case, it just goes 
back to sleep and will try again at the next normal listen time. 

5.3.8 Overhearing avoidance and message passing 

Collision avoidance is a basic task of MAC protocols. S-MAC adopts a contention-based 
scheme. It is common that any packet transmitted by a node is received by all its neighbors 
even though only one of them is the intended receiver. Overhearing makes contention-based 
protocols less efficient in energy than TDMA protocols. 

5.3.9 Overhearing avoidance 

In 802. 1 1 each node keeps listening to all transmissions from its neighbors in order to 
perform effective virtual carrier sense. As a result, each node overhears many packets that 
are not directed to itself. It is a significant waste of energy, especially when node density is 
high and traffic load is heavy. S-MAC tries to avoid overhearing by letting interfering nodes 
go to sleep after they hear an RTS or CTS packet. Since DATA packets are normally much 
longer than control packets, the approach prevents neighboring nodes from overhearing 
long DATA packets and following ACKs. The question is which nodes should sleep when 
there is an active transmission in progress. 

In Figure 5.11, nodes A, B, C, D, E and F form a multihop network where each node 
can only hear the transmissions from its immediate neighbors. Suppose node A is currently 



166 



ADAPTIVE MEDIUM ACCESS CONTROL 



E C A B D F 

Figure 5.11 Which nodes should sleep when node A is transmitting to B? 

transmitting a data packet to B. Which of the remaining nodes should go to sleep during 
this transmission? Remember that collision happens at the receiver. 

It is clear that node D should sleep since its transmission interferes with B's reception. 
Nodes E and F do not produce interference, so they do not need to sleep. Should node C 
go to sleep? C is two hops away from B, and its transmission does not interfere with B's 
reception, so it is free to transmit to its other neighbors, like E. However, C is unable to get 
any reply from E, e.g. CTS or data, because E's transmission collides with A's transmission 
at node C. So C's transmission is simply a waste of energy. Moreover, after A sends to 
B, it may wait for an ACK from B, and C's transmission may corrupt the ACK packet. In 
summary, all immediate neighbors of both the sender and receiver should sleep after they 
hear the RTS or CTS until the current transmission is over, as indicated in Figure 5.11. Each 
node maintains the NAV to indicate the activity in its neighborhood. When a node receives 
a packet destined to other nodes, it updates its NAV using the duration field in the packet. 

A nonzero NAV value indicates that there is an active transmission in its neighborhood. 
The NAV value decrements every time when the NAV timer fires. Thus, a node should sleep 
to avoid overhearing if its NAV is not zero. It can wake up when its NAV becomes zero. We 
also note that in some cases overhearing is indeed desirable. Some algorithms may rely on 
overhearing to gather neighborhood information for network monitoring, reliable routing 
or distributed queries. If desired, S-MAC can be configured to allow application-specific 
overhearing to occur. However, it is suggested that algorithms without requiring overhearing 
may be a better match to energy-limited networks. For example, S-MAC uses explicit data 
acknowledgments rather than implicit ones [28]. 

5.3.10 Message passing 

A message is the collection of meaningful, interrelated units of data. The receiver usually 
needs to obtain all the data units before it can perform in-network data processing or 
aggregation. The disadvantages of transmitting a long message as a single packet is the 
high cost of re-transmitting the long packet if only a few bits have been corrupted in the 
first transmission. However, if we fragment the long message into many independent small 
packets, we have to pay the penalty of large control overhead and longer delay. This is so 
because the RTS and CTS packets are used in contention for each independent packet. A 
possibility is to fragment the long message into many small fragments, and transmit them 
in a burst. Only one RTS and one CTS are used. They reserve the medium for transmitting 
all the fragments. Every time a data fragment is transmitted, the sender waits for an ACK 
from the receiver. If it fails to receive the ACK, it will extend the reserved transmission 
time for one more fragment, and re-transmit the current fragment immediately. As before, 
all packets have the duration field, which is now the time needed for transmitting all the 
remaining data fragments and ACK packets. If a neighboring node hears an RTS or CTS 
packet, it will go to sleep for the time that is needed to transmit all the fragments. Each data 
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fragment or ACK also has the duration field. In this way, if a node wakes up or a new node 
joins in the middle of a transmission, it can properly go to sleep whether it is the neighbor 
of the sender or the receiver. If the sender extends the transmission time due to fragment 
losses or errors, the sleeping neighbors will not be aware of the extension immediately. 
However, they will learn it from the extended fragments or ACKs when they wake up. 

The purpose of using ACK after each data fragment is to prevent the hidden terminal 
problem in the case that a neighboring node wakes up or a new node joins in the middle. 
If the node is only the neighbor of the receiver but not the sender, it will not hear the data 
fragments being sent by the sender. If the receiver does not send ACKs frequently, the 
new node may mistakenly infer from its carrier sense that the medium is clear. If it starts 
transmitting, the current transmission will be corrupted at the receiver. 

It is worth noting that IEEE 802. 1 1 also has fragmentation support. In 802. 1 1 the RTS 
and CTS only reserve the medium for the first data fragment and the first ACK. The first 
fragment and ACK then reserve the medium for the second fragment and ACK, and so forth. 
For each neighboring node, after it receives a fragment or an ACK, it knows that there is 
one more fragment to be sent. So it has to keep listening until all the fragments are sent. 
Again, for energy-constrained nodes, overhearing by all neighbors wastes a lot of energy. 

The 802.1 1 protocol is designed to promote fairness. If the sender fails to get an ACK 
for any fragment, it must give up the transmission and re-contend for the medium so that 
other nodes have a chance to transmit. This approach can cause a long delay if the receiver 
really needs the entire message to start processing. In contrast, message passing extends the 
transmission time and re-transmits the current fragment. It has less contention and a small 
latency. S-MAC sets a limit on how many extensions can be made for each message where 
the receiver is really dead or the connection lost during the transmission. However, for 
sensor networks, application-level performance is the goal as opposed to per-node fairness. 



5.3. 10. 1 Performance examples 

In Ye et til. [34], The simulation results are obtained for the system with the following set 
of parameters: 

Radio bandwidth 20 kbs 

Channel coding Manchester 
Control packet length 10 bytes 

Data packet length up to 250 bytes 

MAC header length 8 bytes 

Duty cycle 1-99 % 

Duration of listen interval 1 15 ms 

Contention window for SYNC 15 slots 
Contention window for data 3 1 slots 

The modulation scheme is the amplitude shift keying (ASK). The power consumptions 
of the radio in receiving, transmitting and sleep modes are 14.4 mW, 36 mW and 15 W, 
respectively. The topology is a two-hop network with two sources and two sinks, as shown 
in Figure 5.12. Packets from source A flow through node C and end at sink D, while those 
from B also pass through C but end at E. The traffic load is changed by varying the inter- 
arrival period of messages. If the message inter-arrival period is 5 s, a message is generated 
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Source 1 



Source 2 




Figure 5.12 Two-hop network with two sources and two sinks. 

every 5 s by each source node. In this experiment, the message inter-arrival period varies 
from 1 to 10 s. 

For the highest rate with a 1 s inter-arrival time, the wireless channel is nearly fully utilized 
due to its low bandwidth. For each traffic pattern, 10 independent tests are done when using 
different MAC protocols. In each test, each source periodically generates 10 messages, 
which in turn is fragmented into 10 small data packets (40 bytes each). Thus, in each 
experiment, there are 200 data packets to be passed from their sources to their sinks. The 
energy consumption of the radio on each node to pass the fixed amount of data is measured . 
The actual time to finish the transmission is different for each MAC module. In the 802. 11- 
like MAC, the fragments of a message are sent in a burst, i.e. RTS and CTS are only used 
for the first fragment. 

The 802.11 -like MAC without fragmentation, which treats each fragment as an inde- 
pendent packet and uses RTS/CTS for each of them, is not measured, since it is obvious 
that this MAC consumes much more energy than the one with fragmentation. In S-MAC 
message passing is used, and fragments of a message are always transmitted in a burst. In 
the S-MAC module with periodic sleep, each node is configured to operate in the 50 % duty 
cycle. 

Figure 5.13 shows the average energy consumption on the source nodes A and B. The 
traffic is heavy when the message inter-arrival time is less than 4 s. In this case, 802. 1 1 MAC 
uses more than twice the energy used by S-MAC. Since idle listening rarely happens, energy 
savings from periodic sleeping is very limited. S-MAC achieves energy savings mainly by 
avoiding overhearing and efficiently transmitting long messages. When the message inter- 
arrival period is larger than 4 s, traffic load becomes light. In this case, the complete S-MAC 
protocol has the best energy performance, and far outperforms 802.11 MAC. Message 
passing with overhearing avoidance also performs better than 802. 1 1 MAC. However, as 
shown in the figure, when idle listening dominates the total energy consumption, the periodic 
sleep plays a key role in energy savings. 

Compared with 802.11, message passing with overhearing avoidance saves almost the 
same amount of energy under all traffic conditions. This result is due to overhearing avoid- 
ance among neighboring nodes A, B and C. The number of packets sent by each of them is 
the same in all traffic conditions. 



5.4 MAC FOR AD HOC NETWORKS 

A key component in the development of single channel ad hoc wireless networks is the MAC 
protocol with which nodes share a common radio channel. Of necessity, such a protocol 
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Figure 5.13 Mean energy consumption on radios in each source node. (Reproduced by 
permission of IEEE [34].) 



has to be distributed. It should provide an efficient use of the available bandwidth while 
satisfying the QoS requirements of both data and real-time applications. CSMA is one of the 
most pervasive MAC schemes in ad hoc wireless networks. CSMA is a simple distributed 
protocol whereby nodes regulate their packet transmission attempts based only on their 
local perception of the state, idle or busy, of the common radio channel. 

Packet collisions are intrinsic to CSMA. They occur because each node has only a 
delayed perception of the other nodes' activity. They also happen due to hidden nodes: 
two transmitting nodes outside the sensing range of each other may interfere at a common 
receiver. Many types of CSMA exist, but invariably the nodes that participate in a collision 
schedule the retransmission of their packets to a random time in the future, in the hope 
of avoiding another collision. This strategy, however, does not provide QoS guarantees for 
real-time traffic support. 

MAC schemes for ad hoc wireless networks have been proposed, aimed either at improv- 
ing the throughput over that of CSMA or at providing QoS guarantees for real-time traffic 
support. Among the first group of schemes is the multiple access collision avoidance proto- 
col (MACA) [35], which forms the basis of several other schemes. With MACA, a source 
with a packet ready for transmission first sends a request-to-send (RTS) minipacket, which 
if successful elicits a clear-to-send (CTS) minipacket from the destination. Upon reception 
of the CTS minipacket, the source sends its data packet. In environments without hidden 
nodes, MACA may improve the throughput of the network over that attained with CSMA 
because collisions involve only short RTS minipackets rather than normal data packets as 
in CSMA. MACA also alleviates the hidden nodes problem because the CTS sent by the 
destination serves to inhibit the nodes in its neighborhood, i.e. exactly those nodes that may 
interfere with the ensuing packet transmission from source to destination. The floor acqui- 
sition multiple access (FAMA) class of protocols [36] includes several variants of MACA, 
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one of which is immune to hidden nodes [37]. These protocols, however, have not been 
designed for QoS: control minipackets are subject to collisions, and their retransmissions 
are randomly scheduled. 

The group allocation multiple access (GAMA) [38, 39] is an attempt to provide QoS 
guarantees to real-time traffic in a distributed wireless environment. In GAMA, there is a 
contention period where nodes use an RTS-CTS dialog to explicitly reserve bandwidth in 
the ensuing contention-free period. A packet transmitted in the contention-free period may 
maintain the reservation for the next cycle. The scheme is asynchronous and developed for 
wireless networks where all nodes can sense, and indeed receive, the communications from 
their peers. MACA/packet reservation (MACA/PR) [40] is a protocol similar to GAMA, 
but an acknowledgment follows every packet sent in contention-free periods to inform the 
nodes in the neighborhood of the receiver whether or not another packet is expected in the 
next contention-free cycle. These schemes deviate from pure carrier sensing methods in 
that every node has to construct channel-state information based on reservation requests 
carried in packets sent onto the channel. 

In this section, we elaborate on the black-burst (BB) contention mechanism presented 
in Sobrinho and Krishnakumar [41]. With this mechanism, real-time nodes contend for 
access to the common radio channel with pulses of energy, BBs, the lengths of which are 
proportional to the time that the nodes have been waiting for the channel to become idle. 
The scheme is distributed and is based only on carrier sensing. It gives priority access to 
real-time traffic and ensures collision-free transmission of real-time packets. When operated 
in an ad hoc wireless LAN, it further guarantees bounded real-time delays. In addition, the 
BB contention scheme can be overlaid on current CSMA implementations, notably that of 
IEEE 802. 1 1 standard for wireless LANs, with only minor modifications required to the 
real-time transceivers: the random retransmission scheme is turned off, and in substitution, 
the possibility of sending BBs is provided. 



5.4.1 Carrier sense wireless networks 

Carrier sense wireless networks are designed in such a way that the distance from which a 
node can sense the carrier from a given transmitter is different and typically larger than the 
distance from which receivers are willing to accept a packet from that same transmitter. In 
addition, the carrier from a transmitter can usually be sensed at a range beyond the range at 
which the transmitter may cause interference. To account for these differences, a wireless 
network is modeled as a set of nodes N, interconnected by links of three different types. 
Node i has a communication link with node j, if and only if in the course of time, it has 
packets to send to node j. Node i has an interfering link with node j if and only if any 
packet transmission with destination j that overlaps in time at j with a transmission from 
i is lost. 

The lost packets are said to have collided with the transmission from i. Finally, node i 
has a sensing link with node j, if and only if a transmission by node i prevents node j from 
starting a new transmission, i.e. node i inhibits node j. The communication, interference 
and sensing graphs are denoted by Gc — (N, L c ), G\ — (N, L{), and Gs — (N, L s ), 
respectively, where, Lc, L\ and Ls are the edge sets (the links). The communication graph 
is a directed graph, whereas the interfering and sensing graphs are undirected. We assume 
that if node i has a communication link with node j, then i and j also have an interfering 
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Figure 5.14 A wireless network without hidden nodes. The shaded nodes form the set 
N s (9). 

link between them. Similarly, an interfering link is also a sensing link, but not conversely. 
That is, Li C Ls : G\ is a spanning subgraph of Gs . Any node has an interfering and sensing 
link with itself, since whenever a node transmits, it cannot simultaneously receive or start 
another transmission. As an example in the wireless network of Figure 5.14, node 9 has 
a communication link with node 10, and thus these nodes have both an interfering and a 
sensing link between them. Nodes 10 and 13 have an interfering link, and thus they also 
have a sensing link between them. Finally, nodes 9 and 13 have only a sensing link between 
them. The links from a node to itself are not explicitly represented. 

A path delay is associated with each sensing link to account for the propagation delay 
separating the nodes, the turn-around (round trip) time of the wireless transceivers, and 
the sensing delay. The path delay of link ij is denoted by r^. Since the sensing graph 
is undirected, r, 7 = zj,-. The path delays further satisfy the two conditions t, 7 > 0 and 
Ttk + tkj > tij, for ik, kj, ij e L s . 

Let x @ max(T (J ). The sets Nj(i) and Ns(i) represent the nodes that are neighbors of 
i, i included, in the interfering and sensing graphs, respectively. In Figure 5.14, A^i(10) = 
{9, 10, 11, 12, 13} and N s (9) = {7,8,9, 10, 11, 12, 13}. For communication link ij, the set 
of nodes which are interfering neighbors of j but are not sensing neighbors of i, i.e. the set, 
N\(j) C\[N — Ns(i)], is the set of nodes hidden from ij. A node in this set will not sense an 
ongoing packet transmission from i to j and may initiate its own packet transmission that 
will collide at j . In a wireless network without hidden nodes, we have Ni(j) C Ns(j) f° r 
every ij e Lq. The network of Figure 5.14 does not have hidden nodes. Nevertheless, the 
common radio channel can be reused in space. For example, a packet transmission from 
node 9 to node 8 can coexist in time without collisions with a packet transmission from 
node 5 to node 7. We use the term 'wireless LAN' for wireless networks in which Gi = Gs 
forms a complete graph. In a wireless LAN, all nodes can sense each other's transmissions. 
The CSMA/CA protocol of the IEEE 802.11 standard defines three interframe spacings, 
^short, Wei, fmed > 2t + t shon and r long , r long >2r + t meA . If a node with a packet that is ready 
for transmission has perceived that the channel is idle during a long interframe spacing of 
length ri on g, the node immediately starts the transmission of the packet. Otherwise, it waits 
until that condition is satisfied and enters into backoff. 
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Likewise, a node whose packet has experienced c consecutive collisions enters into 
backoff. In this mode, the node chooses a random number of slots s uniformly distributed 
between zero and min {32 x 2 C — 1 ,255} and sets a timer with an initial value s x f s i ot units 
of time, where f s i ot , f s i ot > 2r, is the length of a slot. The timer counts down only while 
the channel has been perceived idle for more than f long units of time - it is frozen during a 
medium busy condition - and the packet is (re)transmitted as soon as the timer reaches zero. 
A node learns of the success or failure of its transmission through a positive acknowledg- 
ment scheme; the recipient of a correctly received packet sends back an acknowledgment 
minipacket within an interval of time of length t s h on . 

BB contention is a MAC mechanism developed to provide QoS guarantees to real-time 
traffic over carrier sense wireless networks. The real-time applications considered are those 
like voice and video that require more or less periodic access to the common radio channel 
during long periods of time denominated sessions. The main performance requirement for 
these applications is bounded end-to-end delay, which implies a bounded packet delay 
at the MAC layer. This is the goal of BB contention. Real-time nodes contend for access 
to the channel after a medium interframe spacing of length r me( j, rather than after the long 
interframe spacing of length t\ wg used by data nodes. Thus, real-time nodes as a group have 
priority over data nodes. 

Instead of sending their packets when the channel becomes idle for f me d, real-time nodes 
first sort their access rights by jamming the channel with pulses of energy, denominated 
BBs. The length of a BB transmitted by a real-time node is an increasing function of the 
contention delay experienced by the node, measured from the instant when an attempt to 
access the channel has been scheduled until the channel becomes idle for t me ^, i.e. until the 
node starts the transmission of its BB. To account for the path delays in the network, BBs 
are formed by an integral number of black slots, each of length ?b s iot, with fbsiot not smaller 
than the maximum round-trip path delay 2r. Now, we would like the BBs sent by distinct 
real-time nodes when the channel becomes idle for t me d to differ by at least one black slot. 
To this end, we assume that every real-time packet transmission lasts at least a certain time 
fpkt and that real-time nodes only schedule their next transmission attempts - to a time ? S ch 
in the future - when they start a packet transmission. If a node starts a packet transmission 
at time u and that transmission is successful, that means that no other real-time node started 
a packet transmission during an interval of length 2f p k t around time u. Therefore, the next 
scheduled attempt made by the node in question is also staggered in time by f p k t from the 
scheduled access attempts made by the other nodes. Counting the number of black slots to 
be sent in a BB in units of r p k t , we obtain the desired property that distinct nodes contend 
with BBs comprising different numbers of black slots. Following each BB transmission, 
a node senses the channel for an observation interval of length f D b s to determine without 
ambiguity whether its BB was the longest of the contending BBs. The winning node will 
transmit its real-time packet successfully and schedule the next transmission attempt. On 
the other hand, the nodes that lost the BB contention wait for the channel to once again 
become idle for f me d, at which time they send new longer BBs. In conclusion, once the 
first real-time packet of a session is successfully transmitted, the mechanism ensures that 
succeeding real-time packets are also transmitted without collisions. In the end, real-time 
nodes appear to access a dynamic time division multiplexing (TDM) transmission structure 
without explicit slot assignments or slot synchronization. In the sequel a detailed description 
of the access rules followed by every real-time node is presented. Every real-time packet lasts 
for at least a certain amount of time ? p k t f p kt > 2r, when transmitted on the channel. At the 
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beginning of a session, a real-time node uses conventional CSMA/CA rules, possibly with 
a more expedited retransmission algorithm, to convey its first packet until it is successful. 
Subsequent packets are transmitted according to the mechanisms, described below, until 
the session is dropped. 

Whenever a real-time node transmits a packet, it further schedules its next transmission 
attempt to a time f sc h in the future, where r sc h is the same for all nodes. Suppose, then, 
that a real-time node has scheduled an access attempt for the present time. If the channel 
has been idle during the past medium interframe interval of length r me d, the node starts the 
transmission of a BB. Otherwise, it waits until the channel becomes idle for f me d and only 
then starts the transmission of its BB. The length b of the BB sent by the node is a direct 
function of the contention delay it incurred, d CO rA'- 



where fbsiot is the length of a black slot, the parameter r un i t is the unit of time used to convert 
contention delays into an integral number of black slots, and \_x\ is the floor of x, i.e. the 
largest integer not larger than x. Correct operation of the scheme requires that f un ; t < f p k t . 
After exhausting its BB transmission, the node waits for an observation interval f 0 b S , the 
length of which has to satisfy f Q b s < fbsiot an d fobs < fmed. to see if any other node transmitted 
a longer BB, implying that it would have been waiting longer for access to the channel. If 
the channel is perceived idle after f D b s , then the node (successfully) transmits its packet. On 
the other hand, if the channel is busy during the observation interval, the node waits again 
for the channel to be idle for r mea and repeats the algorithm. 

The start of packet transmissions from different nodes is shifted in time by at least tp^. 
Since it is only when a node initiates the transmission of a packet that it schedules its next 
transmission attempt to a time f sch in the future, the contention delays of different nodes 
will likewise differ by at least t^. Therefore, taking f umt < f p k t , the BBs of different nodes 
differ by at least one black slot, and thus every BB contention period produces a unique 
winner. That winner is the node that has been waiting the longest for access to the channel. 
The observation interval f 0 b S cannot last longer than the black slot time, i.e. fobs < fbsiot, so 
that a node always recognizes when its BB is shorter than that of another contending node. 
It also has to be shorter than the medium interframe spacing, i.e. f obs < f me d, to prevent real- 
time nodes from sending BBs by the time that a real-time packet transmission is expected. 
Overall, the BB contention scheme gives priority to real-time traffic, enforces a round-robin 
discipline among real-time nodes, and results in bounded access delays to real-time packets. 

BB contention can also be used to support real-time sessions with different bandwidth 
requirements, which might be useful for multimedia traffic. On the one hand, distinct real- 
time sessions may have the corresponding nodes send packets of different sizes when they 
acquire access rights to the channel. On the other hand, the BB mechanism can be enhanced 
to accommodate real-time sessions with different scheduling intervals as long as the set 
of values allowed for the scheduling interval f sc h is finite and small. In the latter case, 
BB contention proceeds in two phases. Real-time nodes first sort their access rights based 
on contention delays as before. However, it is now possible for two nodes with different 
scheduling intervals to compute BBs with the same number of black slots. Hence, after this 
first phase, a real-time node contends again with a new BB, the length of which univocally 
identifies the scheduling interval being used by the node. 
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5.4.2 Interaction with upper layers 

5.4.2. 1 Operation with feedback 

If a real-time node were alone in the network, two consecutive real-time packet transmissions 
belonging to the same session would be separated in time by exactly r acc , r acc @ f sc h + 
fbsiot + fobs- The access delays measure the deviation from this ideal situation. Specifically, 
an access delay is the time that elapses from the moment an access attempt occurs until 
the node is able to transmit the corresponding real-time packet, corrected for ?bsiot + fobs- 
For n > 2, the nth access delay associated with a session is denoted by dS n) and is given 
by d (n) = {u in) — st (n_1) — face), where uS n) is the instant of time when the node started the 
transmission of its wfh packet. Given the maximum length of data packets, the rate of real- 
time sessions and number of real-time nodes, the BB mechanism guarantees that the access 
delays are bounded and usually by a very small value d m . dx . 

When a node is the source node of a session, the contents of its real-time packets can 
reflect the access delays incurred in contending for access to the channel. Typically, a real- 
time application generates blocks of information bits at regular intervals of time, of length 
much smaller than f acc . The block delay is the time interval that elapses from the moment 
an information block is made available by the application until it is successfully transmitted 
at the MAC layer (corrected for fbsiot + fobs and neglecting processing delays). The relation 
between access and block delays depends on how the application blocks of information are 
packetized for transmission at the MAC layer. One possibility is to have the MAC layer 
convey in a packet all the information blocks generated up to the instant when the node is 
about to start a packet transmission. The length of a real-time packet would thus grow with 
the access delay incurred by the node. The block delay of the oldest block conveyed in the 
packet would consist of f acc , plus the corresponding access delay: the block delay would 
never exceed r acc + d max . In general, however, it is not feasible to assemble a packet at the 
time that its transmission should start, and further, the MAC layer usually contains a single 
buffer that we must ensure is filled with a packet by the time access to the channel is granted. 

For a realistic alternative within the spirit of this section, consider a simplified communi- 
cation architecture in which a real-time application puts its generated blocks of information 
into an application buffer. Whenever the node successfully transmits a packet it signals 
the application, which will assemble the next packet with all the blocks of information 
currently queued at the application buffer, plus the blocks that will be generated during 
the next interaccess interval of length f acc . At this later time, the packet is delivered to the 
MAC layer for transmission. With this procedure, the MAC layer always has a packet ready 
for transmission by the time it acquires undisputed access to the channel. When a node 
transmits its nth packet at time u (n \ it leaves in the application buffer the blocks of informa- 
tion generated during the previous d {n) units of time; they will be part of the contents of the 
(n + l)th packet. The latter packet further incurs an access delay of d {n+l) at the MAC layer. 
Therefore, the block delay of the oldest block conveyed in the (n + l)th packet is not greater 
than (d„ + f acc + d n+ \): the block delay during a session never exceeds (r acc + 2cf max ). 

5.4.2.2 Operation without feedback 

In the previous section, the contents of a real-time packet depended on the access delays 
incurred by a node. There is a direct coupling between the MAC layer and the real-time ap- 
plication. A simpler communication architecture may be desired in which already assembled 
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packets are passed onto the MAC layer for transmission one by one. This is also the situa- 
tion encountered when a node is simply relaying real-time packets arriving from a distant 
source. 

Suppose that real-time packets are presented to the MAC layer periodically, one every 
f r d y units of time. The packet delay is the time that elapses from the moment a packet is 
available for transmission until it is successfully transmitted at the MAC layer (corrected for 
fbsiot + f obs)- The packet delay of the nth packet <J- n) is given by &/ n) = (u {n) — t {n) — fbsiot — 
fobs), where t (n) is the instant of time when the nth packet becomes ready for transmission, 
f« = t m + (n - l)f rdy . 

Clearly, we should not choose f sch + f bslot + t obs — f rdy . If that choice was made, the 
instants when the node accesses the channel would start drifting in relation to the arrival 
times of new packets, and the node would not keep up with the packet arrival rate. Indeed, 
the packet delay of the nth packet would be 

a><"> = a/ 1 ' + 

which would grow monotonically with the number of packets already transmitted. 

Consider instead a preventive approach whereby a real-time node schedules its next 
transmission attempt short of the inter-arrival time for packets f rc j y . Specifically, when a 
real-time node transmits a packet, it schedules the next transmission attempt to time f sc h in 
the future, now with f sc h = f r( j y — fbsiot — f obs — S, where S, S > 0, is called the slack time. 
At a scheduled access attempt, a real-time node will only start contending for access to the 
channel if a real-time packet is available for transmission. Otherwise, it waits for a ready 
packet and only then starts to contend for access to the channel. The correctness of the BB 
contention mechanism is preserved as long as the contention delays used to compute the 
lengths of BBs are always counted from the scheduled access attempts up to the time when 
the channel becomes idle for f me d. 
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Teletraffic Modeling 
and Analysis 



Traditional traffic models have been developed for wireline networks. These models predict 
the aggregate traffic going through telephone switches. Queueing theory is the tool which has 
been traditionally used in the analysis of such systems. A summary of the main results from 
the queueing theory is included in Appendix C (please go to www.wiley.com/go/glisic). 
These traditional models do not include subscriber mobility or callee distributions and 
therefore need modifications to be applicable for modeling the traffic in wireless networks. 



Channel holding (occupancy) time is an important quantity in teletraffic analysis of PCS 
networks. It corresponds to service time in conventional queueing theory. This quantity is 
needed to derive key network design parameters such as the new call blocking probability 
and the handoff call blocking probability [1]. The cell residence time is a nonnegative 
random variable, so a good distribution model for the random variable will be sufficient 
for characterizing the users' mobility. In this section we use, the hyper-Erlang distribution 
model [2] for such purposes. 

The hyper-Erlang distribution has the following probability density function and Laplace 
transform: 



6.1 CHANNEL HOLDING TIME IN PCS NETWORKS 




t > 0 




(6.1) 
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where a, > 0, and J^iLi a i — 1- M, m\, mz, ■ ■ • , m M are nonnegative integers and rji, 
f]2, ■ ■ ■ ,i)m are positive numbers. These distribution functions provide sufficiently general 
models, i.e. hyper-Erlang distributions are universal approximations. 

It can be shown [3] that for a given cumulative distribution function G(t) of a nonnegative 
random variable we can choose a sequence of distribution functions G m {t), each of which 
corresponds to a mixture of Erlang distributions, so that lim m ^co G m (t) — G(t) for all f at 
which G(f) is continuous. G m (t) can be chosen as 



f > 0 



(6.2) 



where G k m (t) is the distribution function of an Erlang distribution with mean k/m and 
variance k/m 2 (the distribution of the sum of k exponential random variables each with 
mean \lm). If g m (t) and g m {s) are the density function and Laplace transform of G m {t), and 
g k m (t) is the density function of G k m (t) then we have 



m / V m 



k/m 
s + k/m 



(6.3) 



where the asterisk is used to denote the Laplace transformation. 

The resulting distribution is called the 'mixed Erlang distribution'. Their coefficients can 
be determined from the experimental data. If a finite number of terms is used to approx- 
imate the distribution function, the resulting distribution approximates the hyper-Erlang 
distribution. 

To illustate why the distribution G m (t) provides the universal approximation to general 
distribution models we show Erlang distribution 



/e(0 



{mrff 



(m - 1)! 



-mrjt 



t > 0 



(6.4) 



in Figure 6.1 by varying the shape parameter m (Appendix C; please go to www.wiley 
.com/go/glisic). We observe that, as the shape parameter m becomes sufficiently large, the 
density function approaches the Dirac S function. Hence, f e (t) approaches the S function as 
m is sufficiently large. From signal processing theory [4], we know that the S function can be 
used to sample a function and reconstruct the function from the sampled data (the sampling 
theorem). We can replace the S function by the Erlang density function with sufficiently large 
m, and the resulting approximation is exactly in the form of the hyper-Erlang distribution. 

If the cell residence time / is modeled by the hyper-Erlang distribution as in Equa- 
tion (6. 1), its kth moment is given as 



E[t k ]= (-!)*/*>) = £> 



(m; + k - 



( mi - 1) 



1)! , 



(6.5) 



The parameters a, , m, and 0 = 1,2,..., M) can be found by fitting a number of mo- 
ments from field data. Moreover, if the number of moments exceeds the number of variables, 
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Figure 6.1 Probability density function for Erlang distribution. 



l m+\ 



Figure 6.2 The call holding and cell residence times. 

then the least-square method can be used to find the best fit to minimize the least-square 
error. 

The channel holding time distribution depends on the mobility of users, which can be 
characterized by the cell residence time [1-30, 5-31]. In the sequel we use the following 
notation: f c , call holding time (exponentially distributed with parameter [i)\t m , cell residence 
time; r\ , time between the instant a new call is initiated and the instant the new call moves 
out of the cell if the new call is not completed; r m (m > 1), residual life time distribution 
of call holding time when the call finishes mth handoff successfully; and f„h and the 
channel holding times for a new call and a handoff call, respectively. Then, from Figure 6.2, 
the channel holding time for a new call will be 



t nh - min{r c , r x } 
and the channel holding time for a handoff call is 



(6.6) 



r hh — min{r m , t m ] 



(6.7) 
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Let t c , t m , ri, fhh and f n h have density functions / c (f), /(f), /r(0, /hh(0 and /nh(0 with 
their corresponding Laplace transforms f c is), f*(s), f r is), / hh C0 and f nh is), respec- 
tively, and with cumulative distribution functions, f c (t), Fit), F r (t), F bh (t) and F nh (t) 
respectively. From Equation (6.7) we obtain the probability 

Pr(fhh < t) = Pr(r m < t or t m < t) 

= Pr(r m <t) + Pr(f m < t) - Pr(r,„ <t,t m < t) 

= Pr (r m <t) + Pv (t m < t ) - Pr (r m < f ) Pr (t m < t) (6.8) 

= Pr(f c < 0 + Pr(f m < r) - Prfc < t)Pr(t m < t) 

which is based on the independency of r m and t m , and the memoryless property of the expo- 
nential distribution from which we have that the distribution of r m has the same distribution 
as f c . Differentiating Equation (6.8), gives 

MO = fc(t) + fiO - / c (OPr(f m < 0 - Pr(f c < 0/(0 

/CO pOO 
/(T)dr + /(0y / c (r)dT (6.9) 

and 

/*(*) = /*(s) + f*is) - / e-"/(0 / Mr) dr df- / e" s 7c(0 / fir) dr dr 

JO Jo JO Jo 

flW = /*(*) + /*(*) - M / e-^>' / fir) dr dr - / e -"(l - e""') fit) dt 

Jo Jo Jo 

" +^—f*is + n) (6.10) 



S + fl S + /X 

From the above equations, we obtain the expected handoff call channel holding time as 

E [fhh] = -/I'(0) = -[1 - fill)] (6.11) 
H 

where / hh is the first derivative of the function. Starting from Equation (6.6) and using a 



/hh 

similar argument, we obtain 



/OO />0O 
/ r (T)dt + / r (f)y / c (r)dr 

/»hW = -i- + -r- ^ (s + ^ (6 - 12) 

E [?nh] = -/ n t (1) (0) = -[1 - f*(jl)]. 

n 

We need to consider the channel holding time distribution for any call (either new call or 
handoff call), i.e. the channel holding time for the merged traffic of new calls and handoff 
calls. We will simply call this the channel holding time, using no modifiers such as new 
call or handoff call. If th is the channel holding time and Ah the handoff call arrival rate, 
and A. is the new call arrival rate then, t^ = f n h with probability A(A + A n ) and th = % with 
probability Ah(A + Ah). Let / n (0 and f h is) be its density function and the corresponding 
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Laplace transform. It is easy to obtain 

/h(0 = T^—M(t) + 7^7- M(f) 
A -+- Ah A -+- Ah 

* X ^ Ah 

A (i) = rxr + rxr^ (6 - 13) 

A + Ah A + Ah 

When the residual lifetime r\ of t\ is exponentially distributed with parameter fi r , then 
its Laplace transform f T (s) is aWC 5 + fi r ). Using this in Equation (6.12), results in 

/nh(- J )=— — + ^ — \~, ; ; r= — ; (6-14) 

s + /x (s + AW + At + fi t ) s + At + Mr 

which implies that the new call channel holding time is exponentially distributed with 
parameter fi + fi T . Similarly, if the cell residence time f, is exponentially distributed 
with parameter r\, then the handoff call channel holding time is also exponentially dis- 
tributed with parameter fi + r\. In this case, the channel holding time is hyperexponentially 
distributed. If fi, — rj, then the channel holding time [see Equation (6.12)] is exponentially 
distributed with parameter /x + rj.ln fact, since r\ is the residual life of t\ , from the Residual 
Life Theorem [21], we have 

,*, , n [1 - f*(s)] 7] * 

ft (s) = — = — — = / is) (6.15) 

s s + J) 

Hence, the channel holding time is exponentially distributed with parameter /x + r\ when 
the cell residence time is exponentially distributed. 

Simple results for the conditional distribution for channel holding time when the cell 
residence time is generally distributed are presented next. Let f cn h(t), / C hh(0 an d /ch(0 
denote the conditional density functions for new call channel holding time, the handoff call 
channel holding time and the channel holding time, respectively, with Laplace transforms 
f cnh (s), f chh (s) and f cb (s), and with cumulative distribution functions and F c ^(t), F C hh(f) 
and F c h(t). Let us start with the conditional distribution for the handoff call channel holding 
time. We have 

Jo MOT /(T)dTdf 
Fchh(h) - Pr(f hh < h | r m < t m ) = - ' 



Mr,„ < t m ) Pr(r m < t m ) 



foMO H-F(t)] At 



Pr(r m < t m ) 

Differentiation of both sides gives the conditional density function 



(6.16) 



- _ Mh)[l-F(h)] 

f < M = Mr m <t m ) (6 ' 17) 



with 



/»00 ft poo 

Pr(r m <t m )= / f{t) fir) dr dt = / /(f) [l - e^] dt 
Jo Jo Jo 

poo 

= 1-/ f(t)e^'dt = l-f*Qi) 
Jo 



(6.18) 
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Using this in Equation (6. 16) results in 



/chhW = 



[1 - F(h)] /xe-^ 



and 



/chl» = 



p y~ e -( s +M)ft [i _ F d/i 1 — /*(s + /X) 



Similarly, we obtain 



1 -/*(/*) s + 1-/*(M) 

[1 - F r (/z)] /xe"^ 



(6.19) 



(6.20) 



/cnhW 



1 - /r (M) 

m l - /*(* + lA 



* + H 1 - fr (jl) 



(6.21) 



The conditional channel holding time distribution f c h(t), f C h( s ) is the average of the con- 
ditional new call channel holding time distribution and handoff call channel holding time 
distribution. In summary, we therefore have 



/cnhW = 



l-f r (s + n) 



/*(**) 



5 + M 1 
5 + /X 



1 - roo 



(6.22) 



S + \JL 



X 



l-fr(s + H) 



X h !-/*(, + /*) 



A + A h 1 



A + A h l-/*(/x) 



If Tcnh, ?chh and T c h are the expected conditional new call channel holding time, the expected 
conditional handoff call channel holding time, and the expected conditional channel holding 
time, respectively, then we have 

Xcnh — + # 

" 1 - /r (AO 



'chh 



'ch 



1 /*(/*) 
1 - f*(ji) 



1 



(6.23) 



+ 



fr (Jl) 



+ 



f*'(n) 



X + X h l-f*(ji) X + X h l-f*(fi) 



In order to be able to use these results, we need to find the handoff call arrival rate Ah. This 
parameter depends on the new call arrival rate, the new call blocking probability and the 
handoff call blocking probability. If p\, n and /?bh are the new call and handoff call blocking 
probabilities, respectively, and H is the number of handoffs for a call (its expectation E [H] 
is also called handoff rate), then using a procedure similar to the one in Fang et al. [12] 
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gives 

E [H] = -(1 - p hn ) y Res -p — T /*(-*) (6.24) 

where er c denotes the set of poles of f*{—s) in the right half of the complex plane and 
Res J=p denotes the residue at the pole s — p. Since t c is exponentially distributed with 
parameter /x, hence f c (s) — fi/(s + /x), from the above we obtain 

(l-Pbn)/rV) 
1 - (1 - p bh )/*(M) 

Since each unblocked call initiates £[//] handoff calls on average, the handoff call arrival 
rate can be obtained: 

i ,rrm (1 ~ /?bn)V*0) 0 ^ 
A h = A£ [//] = (6.26) 

1 - (1 - Pbh)/ (m) 

As long as / r *(s) and are proper rational functions, then the Laplace transforms of 

distribution functions of all channel holding times (either conditional or unconditional) are 
all rational functions. To find the corresponding density functions, we only need to find 
the inverse Laplace transforms. This can be accomplished by using the partial fractional 
expansion [32]. 

As an illustration, suppose that g(s) is a proper rational function with poles p\, pi, . . . , 
Pk with multiplicities n\,ni, Then g(s) can be expanded as 

k Hi n j 



= 1 7=0 

where the constants can be found easily by the formula 

A u = ^j[(* + PiT'gis)] |. s=0 J =0,1,-.., U i = \,2,...,k 
The inverse Laplace transform L -1 gives 

L ~ l = {t- 1 [/(*)]} . l ~ x + m = ^ 



and 



*JH< d if ti \ 



i=l j=0 

At this stage it is useful to remember that the inverse Laplace transform of a rational 
function is in fact the impulse response of a linear system in which the rational function is the 
system transfer function of the resulting linear system [32] and the cumulative distribution 
function is the step response of the linear system. In Matlab, the commands impulse and 
step can be used to find the density function and the distribution function. When applying 
the hyper-Erlang distribution model for cell residence time, we can in fact reduce the com- 
putation further. Substituting f*(s) in Equation (6.10) with f he (s) given by Equation (6.1), 
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gives 



M 



i=l 



s + /i + s + 



— ( 



mm 



s + niiTji 



M 



= ^2 a ife (*>m, m) 



(6.27) 



i=l 



where / e , 77, ■) corresponds to the handoff call channel holding time when the cell 
residence time is Erlang distributed with parameters (m,-, ??,■). Thus, the problem reduces 
to finding the algorithm for computing the channel holding time for the case when the cell 
residence time is Erlang distributed. 

Performance examples: if the cell residence time is Erlang distributed we have 

fit) = prr- 1 *-* Km - 1)!, f*(s) = [p/(s + p)T 

where ft = tnt] is the scale parameter and m is the shape parameter. The mean of this Erlang 
distribution is rj and its variance is 1 /(mr] 2 ). When the mean r\ is fixed, varying the value m 
is equivalent to varying the variance and larger m means smaller variance and lesser spread 
of the cell residence time. 

The handoff call channel holding time probability density functions with different vari- 
ance of cell residence time distributed according to Erlang distribution with the same mean 
are shown in Figure 6.3. It can be seen that, when the cell residence time becomes less 




Figure 6.3 Probability density function of handoff call channel holding time (solid line) and 
its exponential fitting (dashed line) when cell residence time is Erlang distributed 
with parameter (m, rj). 
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spread, the handoff call channel holding time shows severe mismatch to the exponential 
distribution. 

In the simple case when the cell residence time is hyper-Erlang distributed with two 
terms, 



,*, , / mm \ 1 / m 2 r] \ 

f (s) = ai — + a 2 [ — 

\s+m\r}J \s+m 2 rij 



III 2 



IX s f mil] \ m \ I M s ( m 2 rj \ 



f hh (s) = a\ \ 1 . > + a 2 i 1 . I I 

s + n s + ji \s + m\r]J J [s + fi s + /x \s + m 2 r\ J 



the results are shown in Figure 6.4. When m\ and m 2 have different values, the variances 
of cell residence time are different and the handoff call channel holding time is no longer 
exponentially distributed. 




4 




0 0.5 1 

Time 



4 




Time 



Figure 6.4 Probability density function of handoff call channel holding time (solid line) 
and its exponential fitting (dashed line) when cell residence time is hyper-Erlang 
distributed with parameter (m;, m 2 , r\). 
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7 

Adaptive Network Layer 



7.1 GRAPHS AND ROUTING PROTOCOLS 

The most important function of the network layer is routing. A tool used in the design and 
analysis of routing protocols is graph theory. Networks can be represented by graphs where 
mobile nodes are vertices and communication links are edges. Routing protocols often use 
shortest path algorithms. In this section we provide a simple review of the most important 
principles in the field which provides a background to study the routing algorithms. 

7.1 .1 Elementary concepts 

A graph G(V, E) is two sets of objects, vertices (or nodes), set V, and edges, set E. A graph 
is represented by dots or circles (vertices) interconnected by lines (edges). The magnitude 
of graph G is characterized by number of vertices | V| (called the order of G) and number 
of edges \E\, size G. The running times of algorithms are measured in terms of the order 
and size. 

7.1 .2 Directed graph 

An edge e e E of a directed graph is represented as an ordered pair (u, v), where u, v € V. 
Here u is the initial vertex and v is the terminal vertex. Also assume here that u ^ v. An 
example with 

V = {1,2, 3,4, 5,6}, |V| = 6 

E = {(1, 2), (2, 3), (2, 4), (4, 1), (4, 2), (4, 5), (4, 6)}, \E\ = 7 
is shown in Figure 7.1. 
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Figure 7. 1 Directed graph. 

7.1.3 Undirected graph 

An edge e e E of an undirected graph is represented as an unordered pair (it, v) = (v, u), 
where u, v € V. Also assume that u ^ v. An example with 

V = {1,2, 3,4, 5,6},|V| = 6 

E = {(1, 2), (2, 3), (2, 4), (4, 1)}, (4, 5)(4, 6)\E\=6 

is shown in Figure 7.2. 

7.1 .4 Degree of a vertex 

Degree of a vertex in an undirected graph is the number of edges incident on it. In a directed 
graph, the out degree of a vertex is the number of edges leaving it and the in degree is the 
number of edges entering it. In Figure 7.2 the degree of vertex 2 is 3. In Figure 7.1 the in 
degree of vertex 2 is 2 and the in degree of vertex 4 is 1 . 




Figure 7.2 Undirected graph. 
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Figure 7.3 Weighted graphs. 

7.1 .5 Weighted graph 

In a weighted graph each edge has an associated weight, usually given by a weight function 
w : E — > R. Weighted graphs from Figures 7.1 and 7.2 are shown in Figure 7.3. In the 
analysis of the routing problems, these weights represent the cost of using the link. Most of 
the time this cost would be delay that a packet would experience if using that link. 

7.1 .6 Walks and paths 

A walk is a sequence of nodes (vl, v2, . . . , vL) such that {(vl, v2), (v2, d3), . . . , (vL — 
F vL)} C E, e.g. (V2, V3, V6, V5, V3) in Figure 7.4. A simple path is a walk with no 
repeated nodes, e.g. (VI, V4, V5, V6, V3). A cycle is a walk (vl, v2, . . . , vL) where 




Figure 7.4 Illustration of a walk. 
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Figure 7.5 Complete graphs: (a) [V nodes and V(V — 1) edges] 3 nodes and 3x2 edges; 
(b) [V nodes and V(V — l)/2 edges] 4 nodes and 4x3/2 edges. 

vl — vL with no other nodes repeated and L > 3, e.g. (VI, V2, V3, V5, V4, Vl).Agraph 
is called cyclic if it contains a cycle; otherwise it is called acyclic. A complete graph is an 
undirected/directed graph in which every pair of vertices is adjacent. An example is given 
in Figure 7.5. If (u, v) is an edge in a graph G, we say that vertex d is adjacent to vertex u. 

7.1.7 Connected graphs 

An undirected graph is connected if you can get from any node to any other by following 
a sequence of edges or any two nodes are connected by a path, as shown in Figure 7.6. A 
directed graph is strongly connected if there is a directed path from any node to any other 
node. A graph is sparse if \E\ ^ \ V\. A graph is dense if\E\ | V| 2 . 

A bipartite graph is an undirected graph G — (V, E) in which V can be partitioned into 
two sets, VI and V2, such that (m, v) € E implies either u e VI and v e V2 or v e VI and 
u G V2, see Figure 7.7. 




Figure 7.6 Connected graphs. 
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Figure 7.7 Bipartite graph. 

7.1.8 Trees 

Let G — (V, E) be an undirected graph. The following statements are equivalent: 

(1) G is a tree; 

(2) any two vertices in G are connected by unique simple path; 

(3) G is connected, but if any edge is removed from E, the resulting graph is discon- 
nected; 

(4) G is connected, and \ E\ = | V | - 1; 

(5) G is acyclic, and \E\ = \V\ - 1; 

(6) G is acyclic, but if any edge is added to E, the resulting graph contains a cycle. 
For an illustration see Figure 7.8. 

7.1 .9 Spanning tree 

A tree (T) is said to span G — (V, E) if T — (V, E') and E' C E. For the graph shown 
Figure 7.4, two possible spanning trees are shown in Figure 7.9. Given connected graph G 
with real-valued edge weights ce, a minimum spanning tree (MST) is a spanning tree of G 
whose sum of edge weights is minimized. 
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Figure 7.8 Tree. 




Figure 7.9 Spanning trees. 

7.1.10 MST computation 
7. 1.10.1 Prim's algorithm 

Select an arbitrary node as the initial tree (T). Augment T in an iterative fashion by adding 
the outgoing edge (u, v), (i.e. u e T and v e G — T) with minimum cost (i.e. weight). 
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Figure 7.10 Minimum spanning tree. 

The algorithm stops after |V| — 1 iterations. Computational complexity = 0(|V|2). An 
illustration of the algorithm is given in Figure 7.1 1. 



7.1.10.2 Kruskal's algorithm 

Select the edge e e E of minimum weight — > E' — {e}. Continue to add the edge e e 
E — E' of minimum weight that, when added to E', does not form a cycle. Computational 
complexity = 0(|2?|jclog|2?|). An illustration of the algorithm is given in Figure 7.12. 



7. 1 . 10.3 Distributed algorithms 

For these algorithms each node does not need complete knowledge of the topology. The MST 
is created in a distributed manner. The algorithm starts with one or more fragments consisting 
of single nodes. Each fragment selects its minimum weight outgoing edge and, using control 
messaging fragments, coordinates to merge with a neighboring fragment over its minimum 
weight outgoing edge. The algorithm can produce an MST in 0(\ V \x\ V |) time provided that 
the edge weights are unique. If these weights are not unique, the algorithm still works by 
using the nodes IDs to break ties between edges with equal weight.The algorithm requires 
0(| V|jclog|V|) + \E\) message overhead. An illustration of the distributed algorithm is 
given in Figure 7.13. 
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Figure 7.11 MST solution via Prim's algorithm. 
7.1 .1 1 Shortest path spanning tree 

The shortest path spanning tree (SPST), T, is a spanning tree rooted at a particular node such 
that the | V \ — 1 minimum weight paths from that node to each of the other network nodes 
are contained in T. An example of the shortest path spanning tree is shown in Figure 7.14. 
Note that the SPST is not the same as the MST. 

SPST trees are used for unicast (one to one) and multicast (one to several) routing. 
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Figure 7.12 MST solution via Kruskal's algorithm. 



7.1.11.1 Shortest path algorithms 

Let us assume nonnegative edge weights. Given a weighted graph (G, W) and a node 
s (source), a shortest path tree rooted at s is a tree Tsuch that, for any other node v G 
G, the path between s and v in T is a shortest path between the nodes. Examples of the 
algorithms that compute these shortest path trees are Dijkstra and Bellman-Ford algorithms 
as well as algorithms that find the shortest path between all pairs of nodes, e.g. Floyd- 
Marshall. 




Figure 7.13 Example of the distributed algorithm. 
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Figure 7.14 Examples of minimum spanning tree and shortest path spanning tree. 
7.1.1 1.2 Dijkstra algorithm 

For the source node s, the algorithm is described with the following steps: 
V' = {s};U = V-{s}; 

For i) e [/do 
D v = w(s, v); 

EndFor 

While U ^ (pdo 

Find v G U such that D v is minimal; 
V' = V'U{i>}; U = U-{v}; 
E' = E'U(P v ,v); 
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Forx G U do 

If D v + w(v, x) < D x then 
D x — D v + w(v, x); 
P x = v; 
Endlf 
EndFor 
EndWhile 

An example of the Dijkstra algorithm is given in Figure 7.15. It is assumed that VI is s 
and D v is the distance from node s to node v. If there is no edge connecting two nodes, x 
and y — > w(x, y) — oo. 

The algorithm terminates when all the nodes have been processed and their shortest 
distance to node 1 has been computed. Note that the tree computed is not a minimum 
weight spanning tree. A MST for the given graph is given in Figure 7.16. 

7. 1. 1 1.3 Bellman-Ford algorithm 

Find the shortest walk from a source node s to an arbitrary destination node v subject to 
the constraints that the walk consists of at most h hops and goes through node v only once. 
The algorithm is described by the following steps: 

D; 1 = ooVv e V; 

£>° = 0 and D° = ooV v ^ s, v e V ; 

h = 0; 

Until (£>; = Dj-Wu G V)ov(h = |V|)do 
h = h + 1; 
Fori) G Vdo 

D h + l = min{D h u + w(u, v)}u e V; 
EndFor 
EndUntil 

An illustration for Bellman-Ford Algorithm is given in Figure 7.17(a). 
7. Y Y.4 Floyd-Warshall algorithm 

The Floyd-Warshall algorithm finds the shortest path between all ordered pairs of nodes (s, 
v), {s, v} v G V. Each iteration yields the path with the shortest weight between all pair of 
nodes under the constraint that only nodes {1,2,... n}, n G \V\ can be used as intermediary 
nodes on the computed paths. The algorithm is defined by the following steps: 
D = W; (W is the matrix representation of the edge weights) 

Forw = lto|V|do 

For.? = lto|V|do 

Fori; = lto|V|do 

D StV = min{D. s> „, £>.,,„ + W„,„} 

EndFor 

EndFor 

EndFor 
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Figure 7.15 Example of Dijkstra algorithm. 
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Figure 7.16 An MST for the basic graph in Figure 7. 14. 

The algorithm completes in 0(|V|3) time. An example of the Floyd-Warshall algo- 
rithm is given in Figure 7.17(b) with D — W (W is the matrix representation of the edge 
weights). 

7. 1. 1 1.5 Distributed asynchronous shortest path algorithms 

In this case each node computes the path with the shortest weight to every network node. 
There is no centralized computation. The control messaging is also required for distributed 
computation, as for the distributed MST algorithm. Asynchronous means here that there is 
no requirement for inter-node synchronization for the computation performed at each node 
or for the exchange of messages between nodes. 

7. 1.11.6 Distributed Dijkstra algorithm 

There is no need to change the algorithm. Each node floods periodically a control mes- 
sage throughout the network containing link state information. Transmission overhead is 
0(\V \x\E\). The entire topology knowledge must be maintained at each node. Flooding of 
the link state information allows for timely dissemination of the topology as perceived by 
each node. Each node has typically accurate information to be able to compute the shortest 
paths. 

7.1.11.7 Distributed Bellman-Ford algorithm 

Assume G contains only cycles of nonnegative weight. If (u, v) € E then so does (v, u). 
The updated equation is 

D s v — min {w(s, u) + £>„„}, Vi> e V - {s} 

where N(s) = neighbors of s — > Vm g N(s), (s, u) e E. Each node only needs to know 
the weights of the edges that are incident to it, the identity of all the network nodes and 
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Figure 7.17(a) An illustration of the Bellman-Ford algorithm. An example of the Floyd- 
Warshall algorithm. 
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estimates (received from its neighbors) of the distances to all network nodes. The algorithm 
includes the following steps: 

• each node s transmits to its neighbors its current distance vector D sV ; 

• likewise, each neighbor node u e N(s) transmits to s its distance vector D u ^ v \ 

• node s updates D S V ,V v eV — {s} in accordance with Equation (7.1); 

• if any update changes a distance value then s sends the current version of D s v to its 
neighbors; 

• node s updates D SjV every time it receives a distance vector information from any of its 
neighbors; 

• a periodic timer prompts node s to recompute D s y or to transmit a copy of D s y to each 
of its neighbors. 

An example of distributed Bellman-Ford Algorithm is given in Figure 7.18. 

7. 1. 1 1.8 Distance vector protocols 

With this protocol each node maintains a routing table with entries{Destination, Next Hop, 
Distance (cost)}. 

Nodes exchange routing table information with neighbors (a) whenever table changes; 
and (b) periodically. Upon reception of a routing table from a neighbor, a node updates its 
routing table if it finds a 'better' route. Entries in the routing table are deleted if they are too 
old, i.e. they are not 'refreshed' within a certain time interval by the reception of a routing 
table. 

7.1.11.9 Link failure 

A simple rerouting case is shown in Figure 7.19. 

• F detects that link to G has failed; 

• F sets a distance of oo to G and sends update to A; 

• A sets a distance of oo to G since it uses F to reach G; 

• A receives periodic update from C with 2-hop path to G (via D); 

• A sets distance to G to 3 and sends update to F; 

• F decides it can reach G in four hops via A. 
A routing loop case is shown in Figure 7.20. 

• The link from A to £ fails; 

• A advertises distance of oo to E; 

• B and C had advertised a distance of 2 to £ (prior to the link failure); 
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Figure 7.18 Distributed Bellman-Ford algorithm example. 
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Figure 7.19 Simple rerouting case. 




Figure 7.20 Routing loop case. 



• upon reception of A's routing update, B decides it can reach E in three hops, and advertises 
this to A; 

• A decides it can read E in four hops, and advertises this to C; 

• C decides that it can reach E in five hops . . . 

This behavior is called count-to-infinity. This problem is further elaborated in Figure 7.21 . 
In the figure routing updates with distance to A are shown. When link from A to B fails, 
B can no longer reach A directly, but C advertises a distance of 2 to A and thus B now 
believes it can reach A via C and advertises it. This continues until the distance to A reaches 
infinity. 



GRAPHS AND ROUTING PROTOCOLS 



209 



Example: routers working in stable state 
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Figure 7.21 Count-to-infinity problem. 
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Figure 7.22 Split horizon algorithm. 
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Figure 7.23 Example where split horizon fails. 

7. 1. 1 1. 10 Split horizon algorithm 

Used to avoid (not always) the count-to-infinity problem. If A in Figure 7.22 routes to C via 
B, then A tells B that its distance to C is oo. As result, B will not route to C via A if the link 
B to C fails. This works for two node loops, but not for loops with more than two nodes. 
An example where the split horizon fails is shown in Figure 7.23. 

When link C to D breaks, C marks D as unreachable and reports that to A and B. Suppose 
A learns it first. A now thinks best path to D is through B. A reports D unreachable to B and 
a route of cost 3 to C. C thinks D is reachable through A at cost 4 and reports that to B. B 
reports a cost 5 to A who reports new cost to C, etc. 

7.1.11.11 Routing information protocol 

Routing information protocol (RIP) was originally distributed with BSD Unix and is widely 
used on the Internet (internal gateway protocol). RIP updates are exchanged in ordinary IP 
datagrams and RIP sets infinity to 16 hops (cost e [0-15]). RIP updates neighbors every 
30 s, or when routing tables change. 



7.2 GRAPH THEORY 

The previous section summarized the basic relations and definitions in graph theory. A 
number of references that cover the graph theory in depth are available. The books [1-10] 
all discuss the various aspects of graph theory in general, not merely from a communication 
network point of view. Harary [7] has become a standard in this field. Buckley and Harary 
[3] provide a lot of material related to the hop distance and graph invariants defined in terms 
of distance properties, much of which may have applications in communications networks. 
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Harary et al. [8] provide a comprehensive discussion of connectivity in directed networks. 
Capobianco and Molluzzo [4] present a number of illuminative examples of relations holding 
among common graph invariants, such as node and edge connectivity, including counter 
examples which indicate the extent to which these results are best-possible. 

Graph-theoretic algorithms are given good coverage in Tarjan [11] and in Evan [12, 13], 
but [14] is probably the best introduction to this subject. Additional references are [1 1-28]. 
Colbourn's book [18] is the only general text available at this time which covers network 
reliability. McHugh [22] includes a chapter on the implementation of graph algorithms for 
concurrent and parallel computation. Tenenbaum et al. [27, 28] discuss the implementation 
of some of these algorithms in the Pascal and C languages. See also References [19, 20] for 
more discussion of using data structures and graph algorithms in these computer languages. 
Christofides [17] discusses some important graph algorithms in much more detail, including 
the traveling salesman and Hamiltonian tour problems. A number of these works also treat 
some topics involved with flows in networks, a subject which we have not discussed so far 
but one which does have important applications to network connectivity, as discussed in 
Chapter 13 of this book (see also the article by Estahanian and Hakimi [21]). 

References [29-40, 125, 126] discuss the efficiency and complexity of computer algo- 
rithms in general, not merely the graph- theoretic algorithms. Here the two books by Aho 
et al. [29, 30] and Knut's three volume series [36-38] are the best general references to 
the theory and practice of computer algorithms. Garey and Johnson [32] provide the best 
overall guide to NP-completeness and a compendium of many of those problems that were 
known to be NP-complete as of 1979. A problem is NP-hard if an algorithm for solving 
it can be translated into one for solving any other NP (nondeterministic polynomial time) 
problem. NP-hard therefore means 'at least as hard as any NP problem', although it might, 
in fact, be harder. There are now more than 1000 known NP-complete problems, many of 
them in graph theory, and dozens more are discovered every year, so this catalog has rapidly 
become out of date. As a result there is now an ongoing column on NP-complete problems 
by Johnson [35] which appears several times a year in the journal Algorithms. A number of 
these columns have discussed NP completeness for problems in communication networks 
and reliability. Hare [33] gives a very good and exceptionally readable overall account of 
the current state of the art in algorithmics and a good account of the problems that arise in 
designing and verifying algorithms for parallel processing. The book by Sedgewick [39] 
also comes in two other editions which give more details of the implementations of these 
algorithms in either Pascal or in C. 

Problems concerning the statistical dependence of the network component failures are 
treated in References [41^-8]. One should note that the assumption of independent failures 
can lead to either an overly pessimistic or an overly optimistic estimate of the true network 
reliability. The paper by Egeland and Huseby [41] gives some results as to how one might 
determine which of these is the case. 

Most of the probabilistic measures of network connectivity lead to computability prob- 
lems that are NP-hard, so there has been considerable effort in searching for restricted classes 
of networks for which there are reliability algorithms with a smaller order of complexity 
[49-59]. For example the papers by Boesch [52] and Pullen [58] consider only constant 
probability of edge failures. This may reduce the problem to one in graph enumeration, 
but this problem still has non-polynomial complexity. Similarly, the article by Bienstock 
[50] considers only planar networks, and he proves the existence of an algorithm whose 
complexity grows exponentially in the square root of p, rather than p itself. This is still 
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very far from having polynomial growth, and to obtain that complexity even more drastic 
restrictions are necessary, as shown in the articles by Agrawal and Satayanarana [49] and 
by Politof and Satyanarayana [57]. Even very regular grid networks in the xy-plane yield 
NP-hard problems, as shown by the Clark and Colbourn article [54]. 

Owing to the computational intractability of the network reliability calculations for gen- 
eral probabilistic graphs, there are many papers devoted to the problem of obtaining bounds 
and approximations of the reliability [60-70]. For the same reason Monte Carlo simulations 
are also used in this field [71-77]. The NCF (node connectivity factor) and the LCF (link 
connectivity factor) were introduced in References [78-84] as possible alternatives to the 
usual reliability measure. They are indicators of how close the network is to being totally 
disconnected. Unfortunately, the NCF at least is computationally difficult to compute and 
does not seem to be amenable to simplifying techniques such as factorization or edge re- 
duction used by other methods. Thus it is not yet clear how useful a concept this will prove 
to be, although if these connectivity factors are available they can be used to identify the 
most vulnerable components of the network and to adapt the network so as to equalize the 
vulnerability over its components. References [85-98] are concerned with some aspects 
of graph connectivity other than the usual path-oriented one, primarily with those deriving 
from the notion of the diameter of a graph (i.e. the maximum node-to-node hop distance 
across the graph) or the average node-to-node hop distance. Of special interest here is the 
notion of leverage, as described in the papers of Bagga etal. [85], which is a general method 
of quantifying changes in graph invariants due to the loss of some network components. 

A number of references [99-1 11] are concerned with a number of other graph invariants 
that have an obvious connection with the notions of vulnerability and survivability of com- 
munications networks. The main concepts here are those of dominance, independence and 
covering of a graph with respect to either a set of nodes or a set of edges of the underlying 
graph. These quantities have already been applied to problems involving networks used in 
scheduling and service facilities, although their applications and usefulness to communica- 
tions networks remains to be determined. Also, the calculation of some of these quantities 
can be NP-hard (some in the deterministic sense, others from the probabilistic point of 
view). This is also an area of very active research. 



7.3 ROUTING WITH TOPOLOGY AGGREGATION 

The goal of QoS routing is to find a network path from a source node to a destination node 
which has sufficient resources to support the QoS requirements of a connection request. 
The execution time and the space requirement of a routing algorithm increase with the 
size of the network, which leads to the scalability problem. For very large networks, it is 
impractical to broadcast the whole topology to every node for the purpose of routing. In 
order to achieve scalable routing, large networks are structured hierarchically by grouping 
nodes into different domains [113, 114]. The internal topology of each domain is then 
aggregated to show only the cost of routing across the domain, that is, the cost of going 
from one border node (a node that connects to another domain) to another border node. 
This process is called topology aggregation. One typical way of storing the aggregated 
topology is for every node to keep detailed information about the domain that it belongs to, 
and aggregated information about the other domains. 
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Since the network after aggregation is represented by a simpler topology, most aggre- 
gation algorithms suffer from distortion, that is, the cost of going through the aggregated 
network deviates from the original value [115]. Nevertheless, HaoandZegura [116] showed 
that topology aggregation reduces the routing overhead by orders of magnitude and does 
not always have a negative impact on routing performance. Some aggregation approaches 
have been proposed. Lee [117] presented algorithms that find a minimum distortion-free 
representation for an undirected network with either a single additive or a single bottleneck 
parameter. Examples of additive metrics are delay and cost, while an example of a bottle- 
neck parameter is bandwidth. For an additive constraint, it may require 0(\B\ 2 ) links to 
represent a domain in the distortion-free aggregation, where | B\ is the number of border 
nodes in the domain. Awerbuch and Shavitt [118] proposed an algorithm that aggregates 
directed networks with a single additive parameter using 0(\B\) links. The algorithm 
achieves bounded distortion with a worst-case distortion factor of 0(^/plog , where 
p is the network asymmetry constant, defined as the maximum ratio between the QoS 
parameters of a pair of opposite directed links. 

In this section, we discuss networks with two QoS parameters, delay and bandwidth. 
Some related work can be found in References [1 19]— [121]. Lee [117] presented an ag- 
gregation method that aggregates an undirected delay-bandwidth sensitive domain into a 
spanning tree among border nodes. Therefore, there is a unique path between each pair 
of border nodes after aggregation and the space complexity is 0(\B\). The paper showed 
that a spanning tree can provide a distortion-free aggregation for bandwidth, but not for 
delay. Iwata et al. [120] studied the problem of topology aggregation in networks of six 
different QoS parameters. The aggregated topology follows the ATM PNNI standard [113]. 
The authors proposed minimizing the distortion by using a linear programming approach. 
Both References [1 17] and [118] assumed certain precedence order among the parameters, 
so that, among several paths that go between the same pair of border nodes, one path can 
be selected as the 'best' . The state of a path in a delay-bandwidth sensitive network can be 
represented as a delay-bandwidth pair [121]. If there are several paths going across a do- 
main, a single pair of values, which is a point on the delay-bandwidth plane, is not sufficient 
to capture the QoS parameters of all those paths [122]. 

Korkmaz and Krunz [121] was the first to use a curve on the delay-bandwidth plane to 
approximate the properties of multiple physical paths between two border nodes, without 
assuming any precedence among the parameters. A curve is defined by three values: the 
minimum delay, the maximum bandwidth, and the smallest stretch factor among all paths 
between two border nodes. The stretch factor of a path measures how much the delay and the 
bandwidth of the path deviate from the best delay and the best bandwidth of all paths. The 
curve provides better approximation than a single point, but this approach has several short- 
comings. First, the paper did not provide a routing algorithm with polynomial complexity 
to find a feasible path based on the aggregated topology. Instead, it provided an algorithm to 
check if a given path is likely to be feasible. Essentially, the algorithm determined whether 
the point, defined by the delay/bandwidth requirement, is within the curve denned by the 
delay, bandwidth and stretch factor of the path. Second, although the paper provided an 
aggressive heuristic to find the stretch factor of an interdomain path, there are cases where 
only one QoS metric will contribute to the value, and the information about the other metric 
is lost. In this section, we discuss a way of representing the aggregated state in delay- 
bandwidth sensitive networks using line segments. The approach solves some problems in 
Korkmaz and Krunz [121] and other traditional approaches by introducing a specific QoS 



214 ADAPTIVE NETWORK LAYER 



parameter representation, a specific aggregation algorithm and the corresponding routing 
protocol. The algorithm outperforms others due to smaller distortion. 



7.4 NETWORK AND AGGREGATION MODELS 

A large network consists of a set of domains and links that connect the domains. It is 
modeled as a directed graph, where link state can be asymmetric in two opposite directions. 
Figures 7.24 and 7.25 are examples of a network with four domains. There are two kinds 
of nodes in a domain. A node is called a border node if it connects to a node of another 
domain. A node is an internal node if it is not a border node. A domain is modeled as a 
tuple (V, B, E), where Vis the set of nodes in the domain, B c V is the set of border nodes, 
and E is the set of directed links among the nodes in V. The entire network is modeled as 
(G, L), where G = {g, |g, = ( Vj, Bi, Ei), 1 < i < n] is the set of domains, L is a set of 
links that connect border nodes of different domains, and rj is the number of domains in G. 

There are several aggregation models for large networks. In this section, we use the 
topology aggregation model proposed by the private network-network interface (PNNI) 
[113, 123]. One of the representative topologies in PNNI is the star topology. Other popular 
ones are simple-node and mesh. In a simple-node topology, a domain is collapsed into one 
virtual node. This offers the greatest reduction of information as the space complexity after 
aggregation is 0(1), but the distortion is large. The mesh topology is a complete graph 
among the border nodes. The complexity of this topology is 0(\B\ 2 ) and its distortion is 
much smaller. The star topology is a compromise between the above two. It has a space 




Figure 7.24 Network example. 
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Figure 7.26 Topology aggregation, (a) Domain F. (b) Mesh of the borders, (c) Star repre- 
sentation, (d) Star representation with bypasses. 

complexity of O (| B |) and the distortion is between those of a simple node and a mesh. Guo 
and Matta [124] compare the performances of the above three aggregation methods. 

It shows that the star topology outperforms the simple node and performs slightly worse 
than the mesh in a uniform network. Let us consider the domain in Figure 7.26(a), where 
nodes a, b,c and d are the border nodes. The mesh aggregation is shown in Figure 7.26(b), 
and the star aggregation is shown in Figure 7.26(c). In a star topology, the border nodes 
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connect via links to a virtual nucleus. These links are called spokes. Each link is associated 
with some QoS parameters. To make the representation more flexible, PNNI also allows a 
limited number of links connected directly between border nodes. These links are called 
bypasses. Figure 7.26(d) is an example of a star with bypasses. We call the links in an 
aggregated topology logical links since they are not real. 

After aggregation, a node in a domain sees all other nodes in the same domain, but only 
aggregated topologies of the other domains. For example, for the network in Figure 7.24, the 
aggregated view of the network stored at a node in domain A is shown in Figure 7.25. In such 
a view, the topology of domain A is exactly the same as the original one but the topologies of 
the other domains are now represented by border nodes, nuclei and spokes (without bypasses 
in this example). For a large network, this aggregated view is significantly smaller than the 
original topology and thus scalability is achieved. However, for the purpose of QoS routing, 
it is extremely important to develop solutions on how to represent the state information in 
this aggregated topology and how to control the information loss due to aggregation. 

7.4.1 Line segment representation 

In this section, we discuss a line-segment representation for the aggregated state information. 
Given the original topology and a (D, W) representing (delay, bandwidth) pair for each link, 
we shall first transform every domain to a mesh among the border nodes as an intermediate 
computation step, and then transform the mesh to a star with bypasses. The state information 
of a logical link in either the mesh or the final star is represented by line segments, which 
will be discussed in depth shortly. 

A mesh among the border nodes is a complete graph with logical links connecting each 
pair of border nodes [see Figure 7.26(b)]. A logical link may represent multiple physical 
paths. For example, in Figure 7.27, there are five paths going from border node a to border 
node b. One possible path is a — > 1 — > 3 — > b. The end-to-end QoS parameter of this path 




Figure 7.27 Multiple paths. 
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is (4, 5).We can find the parameters of all other paths and they are (7, 9), (10, 5), (2, 3), and 
(7, 7). The equivalent delay is obtained as a sum of the delays on the path and equivalent 
bandwidth as a minimal value of the bandwidth among the individual segments. When a 
logical link is used to represent all these paths, the QoS parameter of the link should be the 
'best' parameter among the paths. However, the 'best' QoS parameter may not be defined 
since there does not exist an absolute order among those pairs. For example, parameter (2, 3) 
is better than parameter (7, 7) in terms of delay, but not in terms of bandwidth. Fortunately, 
a partial order can be developed. 

By definition a point (x, y) is more representative than a point (x', y') if 

• they are not the same, i.e. either x ^ x' or y ^ y'\ and 

• x < x' and y > y'. 

Since the QoS parameter is a pair of values that represents a point on the delay-bandwidth 
plane, we often use parameter, pair and point interchangeably. In the previous example, 
(7, 9) is more representative than (10, 5) since 7 < 10 and 9 > 5. 

Also, by definition, for a given set (S) of points on the delay-bandwidth plane, (x, y) € S 
is a representative of S if there does not exist any other point (x', y') € S which is more 
representative than (x, y). This means that V(x', y') e S, x < x' or y > y'. As an example 
let S be the set of the delay-bandwidth QoS pairs of the physical paths from a to b in Figure 
7.27 [112]: 



Where (2, 3) is a representative of S, since its delay is less than all other points in S. The 
other representatives are (4, 5) and (7, 9). All QoS points in S on a delay-bandwidth plane 
are plotted in Figure 7.28. The shaded area defines the region of supported services, that is, 
any QoS request that falls in that region can be supported by a physical path. The dotted line 
is a staircase rising from left to right. The representatives are points on the convex corners 
of the steps. The corresponding paths of the representatives are called nondominated paths 



S = {(4, 5), (7, 9), (10, 5), (2, 3), (7, 7)} 



w 



Upper endpoint 



Lowe ■ endpoint 




D 



Figure 7.28 Representatives from example. (Reproduced by permission of IEEE [112].) 
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or Pareto optimal paths in the literature. The size of S depends on how many physical 
paths there are between the pair of border nodes, which can be exponential. The number of 
representative points is \E\ in the worst case for a domain (V, B, E). That is because there 
are at most \ E\ possible bandwidth values for the paths in the domain and thus the staircase 
can have at most \ E\ convex corners. 

For the purpose of scalability, it is desirable to reduce the memory space for storing the 
QoS parameter of a logical link to 0(1). Hence, we shall neither store all QoS points in S 
nor store all representative points. The traditional way to solve this problem is to keep only 
one QoS point per logical link. However, no matter which point we pick, much information 
is lost. 

A possible solution for this problem is to use a line segment that approximates the stair- 
case, e.g. Li or 1L2 n (Figure 7.28). Since every line segment can be defined unambiguously 
by two endpoints, the space needed for a logical link is 0(1). The line segment representa- 
tion strikes a tradeoff between the accuracy in approximating the staircase and the overhead 
incurred. 

After a line segment is chosen, all connection requests that fall under the line segment 
are accepted. However, it should be pointed out that not all requests under the line segment 
can actually be supported. For example, in Figure 7.28, if L\ is selected to approximate the 
staircase, then the unshaded areas below L \ represent connection requests that are accepted 
but not supported by any physical path. When a request is accepted but not supported, the 
routing process will detect it, and the request will eventually be rejected if a feasible path 
cannot be found (see the next section). On the other hand, when we reject the connection 
requests that are above the line segment, we may reject the supported QoS. For example, if 
a connection request is in the shaded region above L\ in Figure 7.28, it is rejected although 
it can be served. Therefore, the choice of line segment depends on the strictness of the 
desired quality of the service. For instance, in Figure 7.28, both L\ and L 2 are possible 
line segments. Using L2 would probably reject more supported connection requests than 
Li, while using L\ would accept more unsupported requests than L2. The least square 
method is used to find a line segment, which takes linear time with respect to the number of 
representative points. This line segment minimizes the least-square error, i.e. the summation 
of the squares of the distances from the points to the line. 

It was mentioned earlier that any line segment can be defined by two endpoints. Owing 
to the nature of the staircase, the line always has a nonnegative slope. The endpoint with 
smaller bandwidth is called the lower endpoint, while the other one is called the upper 
endpoint. So a line segment is denoted as [lower endpoint, upper endpoint]. For example, 
L 2 in Figure 7.28 is [(2, 1.5), (9, 9)]. We further denote the lower endpoint and the upper 
endpoint of a line segment I as Up and I. up, respectively. The delay of a point p is p.d 
and the bandwidth of a point p is p.w. Therefore, is E^.lp.d is 2 and L2.up.vu is 9. Two 
operations for line segment parameters are defined. The first operation is the joint '+' and 
disjoint '— ' operation, defined as 



[(a, b),(c, d)] + [(a', b'),(c', d')] = 
[(a + a', min (b, b')) , (c + c' , min (d, 



*))] 



(7.1) 



[(a, b) , (c, d)] - [(a', V) , (c', d')] = 
[(a — a', min (b, b')) , (c — c' , min (d, 



J'))] 



(7.2) 
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In other words, summing two segments results in a segment whose delay is the sum of 
the delays of the individuals segments and the bandwidth equals the minimum of the two 
bandwidths. If l\ , fa and fa are three line segments and they satisfy the following conditions: 

h.lp.d > h-lp-d and l\.up.d > h.up.d 

h.lp.w < h-lp-w and li.lp.w < I2.lp.vu (7.3) 
h — fa — h 

then 

h=h+h (7.4) 

As an example let l\ — [(10, 4) , (13, 7)] and fa — [(6, 5) , (11, 7)]. One can see that 
for this example h.lp.d > h-lp.d, h.up.d > h.up.d, h.lp.w < h.lp.w and h.lp.w < 
I2.lp.v0. and we have 

h- fa = [(10, 4), (13, 7)] -[(6, 5), (11, 7)] 

= [10-6, min(4, 5), (13- 11, min(7, 7))] (7.5) 
= [(4, 4), (2, 7)] 

fa + h = fa + [(4, 4) , (2, 7)] = [(6, 5) , (11, 7)] + [(4, 4) , (2, 7)] 

= [6 + 4, min (5, 4) , (1 1 + 2, min (7, 7))] (7.6) 
= [(10, 4), (13, 7)] =h 

7.4.2 QoS-aware topology aggregation 

The topology aggregation algorithm consists of two phases: (1) find a line segment for 
each logical link in the mesh (complete graph) of the border nodes; and (2) construct a star 
topology with bypasses from the mesh. 

7.4.3 Mesh formation 

In this phase, representatives of the paths between each pair of border nodes are found. 
The representatives for each pair of border nodes can be obtained by running the Dijkstra's 
algorithm for every link bandwidth value. The links in the domain are first sorted out by 
descending bandwidths. Let the largest bandwidth be w m - dx . The algorithm starts by finding 
the smallest delay d m . dx w in a graph that consists of only links of bandwidth w max . The 
pair (d m . dx w , w max ) is a representative that has the largest bandwidth. The algorithm then 
inserts the links of the next largest bandwidth value (u/) and finds the smallest delay d'. If 
d < d m . dx w , (d', w') is another representative. The process ends after the smallest delays 
for all bandwidth values are identified. The detailed algorithm of finding the representatives 
can be found in References [125] and [126]. After the representatives are found, linear 
regression is used to find the line segment for each pair of border nodes. 
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7.4.4 Star formation 

The next step is to aggregate the mesh into a star topology with bypasses. According to the 
recommendation of PNNI, a default parameter value to spokes can be set. There can be one 
or more spokes of the default parameter value but at most 3 | B \ — 1 links can be of other 
values. As it is possible that all links in our aggregation are of different values, we shall put 
at most 3 | B | links in the domain topology after aggregation. 

Let i and j be two border nodes and n be the nucleus in the star representation. If there 
is no bypass between i and j, the only path ;' from to j is i — > n — > j in the star. The goal 
in this phase is to find the QoS parameters of links i — > n and n — > j, such that the line 
segment of ; : n — > j in the star is the same as the line segment of ii — > jn the mesh. 
Basically, a single link in the mesh has to be 'split' into two links i — > n and n — > j in 
the star. There are three steps in this phase: (1) find the spokes from the border nodes to 
nucleus; (2) find the spokes from the nucleus to the border nodes; and (3) find the bypasses 
between border nodes. 



7.4.4. 1 Spokes incoming to the nucleus 

In order to distinguish the line segments in the mesh from those in the star, superscript 'm' 
is used for mesh and V for star. For instance, let us consider a line segment from i to j. If 
it is in the mesh, it is denoted Z™; if it is in the star, it is denoted Ifj. 

In order to find spokes, we have to 'break' the line segments in the mesh. From the 
definition of the joint operation, we have a general idea what the 'broken' line segments 
look like. The endpoint delays of the spokes l- n and should be smaller than those of Z™, 
while the endpoint bandwidths of the spokes should not be smaller than those of Z™. The 
algorithm of finding spokes from border node to nucleus is based on these observations. 

Recall that the lower endpoint and the upper endpoint of a line segment Z are denoted 
Up and I. up, respectively. Given a point p, its delay and bandwidth are denoted as p.d and 
p.w, respectively. The algorithm that finds spokes incoming to the nucleus is defined as 

l\ n — [(minJaf, max Jw), (min _ud, max _uw)] (7.7) 

where 

rninJcf = min \l"\.lp.d\ 
min _ud — min \lf:.up.d\ 

(7.8) 

maxJio = max \l"].lp.w\ 

jeBJjti 1 11 > 

max-uw— max \l"].up.w\ 
An example of the program calculating Equation (7.7) can be found in Lui et ol. [112]. 



7.4.4.2 Spokes outgoing from the nucleus 

Up to this point, we know the mesh, and we also know the set of spokes from borders to 
the nucleus, which is denoted 5^„. More specifically, we know Z™ as well as Z ; s n , and we 
want to find Z^, such that the result of joining l* n and Z^- is Z™. Since IfJ.lp.d > l\ n .lp.d, 
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Ifj.up.d > l\ n .up.d,l™.lp.w < l- n .lp.w,andlfj.up.w < l\ n . up.w, we can obtain Z;^ by eval- 
uating Z™ — l- n , according to Equation (7.3). However, for the same j, Z™ — l s in may be dif- 
ferent for different i. Since we can have at most one l % n - , we solve this problem by assigning 
l s nj the average of IfJ - l\ n , for all i e B, i / j. 

7.4.4.3 Finding bypasses 

Owing to the aggregation, Ifj = Z ( s n + l % n - may no longer be the same as Z™ in the mesh. 
Some may deviate only a little, while others may be quite different. In order to make the 
aggregation more precise, bypasses are introduced, which are direct links between border 
nodes . When Z? n + Zjj . deviates a lot from Z™, a bypass between border nodes i and j is 
used, and the QoS parameters of the paths from i to j are defined by the bypass instead of 
l- n + Since there are 2 \ B\ spokes outgoing from and |B| incoming to the nucleus), 
\B\ bypasses can be used in the network according to the recommendation of the PNNI 
standard. Only bypasses between those \B\ pairs of border nodes that have the largest 
deviations are used. 



7.4.5 Line-segment routing algorithm 

In this segment the line-segment routing algorithm (LSRA) is presented. It is a QoS-based 
source routing algorithm, which integrates the line segment representation with Dijkstra's 
algorithm (DA) and the centralized bandwidth-delay routing algorithm (CBDRA) [125, 
127]. DA is the fastest known algorithm for finding the least-delay paths. CBDRA first 
prunes all links that do not satisfy the bandwidth requirement and then applies DA to find 
the least-delay path. LSRA extends the idea to capitalize the additional information provided 
by the line segment representation. 

Being a centralized routing protocol, LSRA requires that each node keeps the topology 
of its own domain and the star-with-bypasses aggregation of the other domains. As broad- 
casting takes a lot of time and bandwidth, it is desirable to keep the amount of broadcasted 
information small. This justifies why the aggregation is a star with bypasses of 0(\B\) 
space instead of a mesh of 0(\B\ 2 ) space. Routing is performed at two levels: interdomain 
routing and intradomain routing. An interdomain routing path specifies the border nodes 
of the transit domains, and the intradomain routing finds the subpath within each transit 
domain. Accordingly, LSRA has two routing phases. 

7.4.5. 1 Interdomain routing 

After obtaining the star-with-bypasses aggregation from the external domains, each node 
can see all nodes in its own domain and all border nodes of the other domains. There are 
five steps in the LSRA interdomain routing: 

(1) transform stars with bypasses to meshes - since nuclei of stars are virtual, the actual 
routing paths should not include any nucleus; 

(2) prune logical links - this step prunes the logical links that do not satisfy the bandwidth 
requirement; 
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(3) determine the delays of logical links - the delay value, supported by a line segment, 
is a function of the bandwidth requirement; this step determines the delay values of 
all logical links under the bandwidth requirement; 

(4) prune physical links - this step prunes the physical links that do not satisfy the 
bandwidth requirement; 

(5) apply DA on the network - this step uses DA to find a shortest-delay path to the 
destination domain. 

7.4.5.2 Intradomain routing 

After the interdomain routing, an interdomain path is determined. The source node knows 
how to traverse the nodes in its own domain to a border node, and how to traverse the border 
nodes of other domains to get to the destination domain. Because the source node does 
not have the detailed topology of any external domain, it does not know how to fill in the 
intradomain path segments across the external domains. Therefore, LSRA does the intrado- 
main routing in a distributed fashion. A routing message is sent from the source to travel 
along the interdomain path. When a border node t of domain g receives the message and 
the next hop on the interdomain path is another border node t' of g, t locally computes the 
intradomain path going from itself to t ' using CBDRA, since t has the complete knowledge 
about its own domain g. Node t inserts the intradomain path into the interdomain path that 
is carried by the message. Then the message is sent to t' along the intradomain path. The 
message also keeps track of the accumulated delay of the path that has been traversed so far, 
including the intradomain path segments. This accumulated delay is calculated from the ac- 
tual link delays as the message travels. If the accumulated delay exceeds req(d) the message 
is forwarded back to the previous node to find an alternate path. This is called crankback 
[113]. If the message successfully reaches the destination node, a feasible path is found. 

One should be aware that the expected path delay may be different from the actual 
path delay due to the network dynamics or the inaccuracy introduced by aggregation. If 
no intradomain path can be found without violating req(d), the message is sent back to the 
source to reject the request. 

7.4.5.3 Best point algorithm 

In this case the best delay and the best bandwidth are used to represent a mesh link. For ex- 
ample, if the delay-bandwidth parameters between a border pair are (2, 3), (4, 4), and (5, 6), 
then (2, 6) is used to represent the QoS of the mesh link. This optimistic approach is aggres- 
sive by choosing the best delay and the best bandwidth that may come from different paths. It 
suffers from large crankback ratios defined in the sequel. A best point (BP) mesh can be ag- 
gregated to a BP star by taking the maximum bandwidth among the mesh links as the spoke 
bandwidth and by taking half of the average delay among the mesh links as the spoke delay. 

7.4.5.4 Worst point 

The worst delay and the worst bandwidth are used to represent a mesh link. That is, if the 
parameters are (2, 3), (4, 4) and (5, 6), then (5, 3) is used. As a result, this algorithm has 
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low success ratios. The minimum bandwidth and average of half of the average delay are 
used for aggregating a worst point (WP) mesh to a WP star. 

7.4.5.5 Modified Korkmaz-Krunz 

This is a modified version of the algorithm presented in Korkmaz and Krunz [121], the MKK 
algorithm. Let us consider a set of delay-bandwidth parameters, which corresponds to a set 
of paths between two border nodes. The stretch factor of a parameter (d, w) is defined as 
d /<ibest + Wbest/ w, where finest and u;b es t are the best delay and the best bandwidth among all 
parameters, respectively. Refer to the examples used previously for BP and WP, finest = 2 
and Wbest = 6. 

The stretch factor of (2, 3) is 2/2 + 6/3 = 3, and the stretch factors of (4, 4) and (5, 6) 
are both 3.5. The smallest stretch factor is denoted as s .factor, which is 3 in the above case. 
The mesh link between the two border nodes is represented by a tuple (finest, Wb es t, s .factor). 
The mesh link is likely to support a connection request [req(d),req(w)], if, finest < req(d), 
Wbest > req(w) and s-factor < req(d)/db es i + Wbe St /req(w) [121]. 

The transformation from a mesh to a star, in the original KK algorithm, is defined as 



where m, j .d is the delay of a mesh link from border node ; to border node j , b is the number 
of border nodes, s in is the delay of a spoke link from i to the nucleus, and s n j is the delay 
of a spoke link from the nucleus to j. 

The problem of this averaging approach is that the delay after transformation deviates a 
lot from the original delay, because the delay of s in .d + s„j .d may be twice the delay of the 
mesh link (m,y.d). Therefore, the following modification is used (MKK): 



7.4.6 Performance measure 

In this kind of analysis the most often used performance measures are: delay deviation 
(DD), success ratio (SR) and crankback ratio (CBR). 

7.4. 6. 1 Delay deviation 

Owing to the distortion of aggregation, the delay between a source node and a destination 
domain, obtained using the method described in the previous sections, is not accurate. Delay 




i=i,iW 



(7.9) 




(7.10) 
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deviation measures the difference between the real delay and the estimated delay, obtained 
by the aggregation. It is defined as 

DD = \d esp — d asp \ (7.11) 

where d esp is the estimated delay of the shortest path and d. ds? is the actual delay of the 
shortest path. 



7.4.6.2 Success ratio 

A feasible request may be rejected due to the imperfect approximation of the delay and band- 
width information during aggregation. The success ratio is used to measure quantitatively 
how well an algorithm finds feasible paths and it is defined as 

^ total number of acceptable feasible requests 
total number of feasible requests 

The dividend represents all connection requests that are accepted by both interdomain 
routing and intradomain routing. The divider is the total number of feasible requests (not 
the total number of requests). Therefore, in simulation results, the success ratio measures 
the relative performance with respect to the optimal performance (accepting all feasible 
requests). 



7.4.6.3 Crankback ratio 

When an algorithm finds an interdomain path from the source node to the destination domain, 
it may overestimate the bandwidth or underestimate the delay and lead to crankbacks during 
intradomain routing. Crankback ratio measures how often that happens and is defined as 

total number requests crankbacked 

CBR= (7.13) 

total number of requests accepted by interdomain routing 

A good algorithm should have small crankback ratios, high success ratios and small delay 
deviations for different bandwidth and delay requirements. 



7.4.7 Performance example 

The simulation environment consists of the interdomain topology based on the power-law 
model [128, 129], and the intradomain topology based on the Waxman model [130]. The 
degree of a domain is defined as the total number of interdomain links adjacent to the border 
nodes of the domain. The simulation testbed and parameter configuration are set as follows: 
10% of the domains have a degree of one, and the degrees of the other domains follow the 
power law, i.e. the frequency f& of a domain degree is proportional to the degree d (> 2) 
raised to the power of a constant O — —2.2. In other words f^ocd 0 . After each domain 
is assigned a degree according to the power law, a spanning tree is formed among the 
domains to ensure a connected graph. Additional links are inserted to fulfill the remaining 
degrees of every domain with the neighbors selected according to probabilities proportional 
to their respective unfulfilled degrees. The Waxman topology for the internal nodes of each 
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domain is formed as follows: the nodes are randomly placed in a one-by-one square, and the 
probability of creating a link between node u and node ids p (u, v) oc exp [— d (u, v) /fiL]. 

In this expression d(u , v) is the distance between u and v, /S = 0.6, and L is the maximum 
distance between any two nodes. The average node degree is 4. The other simulation 
parameters are as follows: the number of domains in each topology is 200, the number of 
nodes in each domain is randomly selected between 10 and 40, the delay of a physical link is 
a random number between 2 and 10 units, and the bandwidth of a physical link is a random 
number between 1 and 5 units. For delay, a unit can be replaced by a certain number 
of milliseconds; for bandwidth, a unit can be replaced by a certain number of kilobytes 
(per second). Each data point is the average of 1000 randomly generated requests. More 
specifically, given a bandwidth requirement and/or a delay requirement, five topologies are 
randomly generated. On each topology, 200 requests are generated with the source and 
the destination randomly selected from the topology. LS, BP, WP and KK are run over 
these requests, respectively, and the average results give a set of data points in the figures. 
Simulations with the same set of parameters are presented for example in Liu et al. [112]. 

The simulation results, using the described simulation environment, are shown in Fig- 
ure 7.29. The horizontal axis is the bandwidth requirement. The results show that the 
modified algorithm (MKK) has better success ratios than the original algorithm. For CBR, 
the MKK algorithm is worse than the original KK algorithm, because the spoke delays in the 
modified KK are smaller than the spoke delays in the original KK. More optimistic estima- 
tion leaves more room for overestimation, which may lead to more crankbacks. However, as 
the crankback ratios of the modified KK are less than 5 % for all bandwidth values (Figure 
7.3 1), it is believed that the performance improvement in success ratios outweighs the degra- 
dation of crankback ratios. As a result, the MKK algorithm is used in the rest of the section. 

Figure 7.30 shows the delay deviations of LS, BP, WP, and KK. The horizontal axis 
presents the bandwidth requirement. For fairness, we count the paths of a request only 
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Figure 7.29 Success ratios of two aggregation schemes of the Korkmaz-Krunz method. 
(Reproduced by permission of IEEE [112].) 
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Figure 7.30 Delay deviation of different aggregation schemes. (Reproduced by permission 
of IEEE [112].) 
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Figure 7.31 Success ratio vs bandwidth and delay. (Reproduced by permission of IEEE 
[112].) 



when all four algorithms accept that request. Figure 7.30 shows that LS has smaller delay 
deviations than the other algorithms. That is because a line segment approximates the pa- 
rameter staircase better than the other approaches, and thus gives more accurate information 
about the delay. The deviations are small when the bandwidth requirements are large (4), 
because the feasible paths are mostly short (within the same domain or between neighboring 
domains). 

The success ratios are shown in Figure 7.31. It may be surprising that the success ratios 
are close to 1 when the bandwidth requirement is five units (maximum link capacity). Recall 
that our success ratio is defined relative to the number of feasible requests, not to the number 
of all requests. When the bandwidth request is large, there are few feasible requests. These 
feasible requests are mostly between nodes that are close to each other (in the same domain 



REFERENCES 227 



0.6 -, 
0.5 - 

o 

| 0.4- 

M 

o 

S 0.3 - 



U 0.2 - Z 
0.1 - V 

J J 



1 



2 3 4 

Bandwidth (unit) 



_ LS 
0 BP 
■ WP 

H KK 



0.45 
0.4 
0.35 

o 

3 °- 3 
•g 0.25 

O 0.15 
o.i 

0.05 
0 



□ LS 

ra bp 

■ WP 

H KK 



30 40 50 60 70 80 90 
Delay (unit) 



Figure 7.32 Crankback ratio vs bandwidth and delay. (Reproduced by permission of IEEE 
[112].) 
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Figure 7.33 Success ratio-crankback ratio. (Reproduced by permission of IEEE [112].) 

or neighboring domains), which makes the delay deviation small, as shown in Figure 7.30. 
That means it is less likely to reject a feasible request due to distortion. Therefore, the 
success ratio is high. 

Figure 7.32 shows the crankback ratios of BR WR KKand LS. It is clear that BP has very 
high crankback ratios compared with other methods. On the contrary, as an aggregated WP 
topology tends to overestimate the real delay and underestimate the real bandwidth of the 
paths, WP has very small crankback ratios. Figure 7.33 compares success ratio-crankback 
ratio. BP does not perform well in this comparison due to its high crankback ratios. In 
contrast, LS has higher success ratios than WP and KK, and has reasonable crankback 
ratios. It outperforms all other schemes for different bandwidth and delay requirements by 
making a better tradeoff between success ratio and crankback ratio. 
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Effective Capacity 



8.1 EFFECTIVE TRAFFIC SOURCE PARAMETERS 

For efficient resource allocation for connections requiring QoS, reliable and accurate char- 
acterizations of the offered traffic and available effective capacity in the network are needed 
[1-5, 7-21]. The ATM forum [1] has adopted an open-loop flow control mechanism for 
ATM networks known as usage parameter control (UPC). A UPC algorithm accepts a cell 
stream as input and determines whether or not each cell conforms to the parameters of the 
algorithm. Depending on the network policy, nonconforming cells are either dropped prior 
to entering the network or tagged as low priority. The UPC function acts as a traffic policer 
at the network access point. In contrast, a traffic shaper smoothes out the bitrate variations 
in a stream by delaying rather than dropping or tagging cells. A user can avoid possible 
cell loss at the network access point due to UPC by shaping its cell stream to conform to 
the negotiated UPC. All those actions will result in an effective traffic source. The UPC 
mechanisms proposed by the ATM Forum are variants of the leaky bucket algorithm. The 
leaky bucket is a simple means of shaping or policing a traffic stream and has desirable 
attributes in terms of network control [2-A]. 

A leaky bucket is parameterized by a leak rate [i and a bucket size B. It is convenient 
to describe the operation of the leaky bucket in terms of a fictitious x/D/l/B queue for a 
deterministic input (D) served at constant rate [i with capacity B [2] . In this model, each cell 
arrival to the leaky bucket generates a corresponding customer arrival to the fictitious queue. 
At the time of a cell arrival, if the number of customers in the fictitious queue is less than 
B, the cell is said to be conforming, otherwise the cell is nonconforming. Nonconforming 
cells are either dropped or tagged as low priority cells. 

A leaky bucket shaper parameterized by (/x, B) operates in a manner similar to the leaky 
bucket policer, except that nonconforming cells are delayed, rather than dropped or tagged. 
The operation of the leaky bucket shaper can be understood in terms of an x/D/l queue. 
Upon arrival of a cell, if the number of customers in the fictitious queue is B or more, the 
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cell is placed in an infinite capacity first-in, first-out (FIFO) buffer. Whenever a customer 
leaves the fictitious queue, a cell (if any) is removed and transmitted from the FIFO buffer. 

The dual leaky bucket consists of two leaky buckets: a peak rate leaky bucket with 
parameters (X p , B p ) and a sustainable rate leaky bucket with parameters (X s , B s ), where 
X p > X s . A cell is conforming with respect to the dual leaky bucket if and only if it is 
conforming with respect to both the peak and sustainable rate buckets. The sustainable 
rate A. s represents the maximum long-run average rate for conforming cell streams. The 
peak rate X p specifies a minimum intercell spacing of l/X p with burst tolerance B p . In this 
section, we shall assume that B p — 1 , so that the intercell spacing of conforming streams 
is constrained to be at most l/X p . Values of B p > 1 allow for cell delay variation (CDV) 
between the customer premises and the network access point. 

The set (X p , X s , fi s ) constitutes a UPC descriptor that the user negotiates with the network. 
If the connection is accepted, the user applies a dual leaky bucket shaper to shape its cell 
stream according to the negotiated UPC. The dual leaky bucket limits the peak rate to X p , 
the long-term average rate to X s , and the maximum burst length (transmitted at peak rate) 
to B c = B s X p /(X p - X s ). 

Deterministic traffic characterization is based on the queueing model of the leaky bucket 
[5]. Suppose that the cell stream is of finite length and that the peak rate X p is known. The 
cell stream is offered as input to a queue served at constant service rate /x. Let B max (/x) be 
the maximum number of cells observed in the system over the duration of the cell stream. 
The stream then can be characterized by the peakrate and the values B m . dx (pi) for 0 < /x < X p 
as 

C D = [X p , B max (/x), 0 < fi < X p ] (8.1) 

Equation (8.1) can be interpreted as follows. For each [i, 0 < ji < X p , the cell stream passes 
through a leaky bucket parameterized by [fi, B max (/x)] without being shaped, i.e. the cell 
stream conforms to the leaky bucket parameters [/x, 5 ma x(M)]- The notation B(Cd, /x) = 
Bmax(P') is introduced to show that the choice of bucket size, for a given leak rate fi, depends 
on Cd- The characterization is tight in the following sense. If the cell stream conforms to a 
given set of leaky bucket parameters (/x, B'), then necessarily B' > B(Cd, /x). For a given 
leak rate /x*, the minimum bucket size for conformance is B* > B(C D , A**)- 

Ideal statistical traffic characterization is based on assumption that the peak rate of the 
cell stream X p and the mean rate A. m are known. Suppose that the cell stream is offered to a 
leaky bucket traffic shaper with leak rate /x, where X m < /x < X p , and bucket size B. 

The shaping probability is defined as the probability that an arbitrary arriving cell has to 
wait in the data buffer of the leaky bucket. In the x/D/ 1 queueing model of the leaky bucket, 
this event corresponds to a customer arrival to the fictitious queue when the number in the 
system exceeds B. If B(/x, e) is defined as the minimum bucket size necessary to ensure 
that the shaping probability does not exceed e, then 

C s (e) = [X p , X m , (/x, B(n, e)), X m < /x < k p ] (8.2) 

represents statistical traffic characterization. 

Approximate statistical traffic characterization is based on the assumption that the cell 
stream is offered to a queue with constant service rate ji, where k m < /x < X p . If W is the 
steady-state waiting time, in the queue, for an arbitrary cell arriving in the queue, then the 
complementary waiting time distribution, can be approximated as (see Appendix C; please 
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go to www.wiley.com/go/glisic): 

P(W > t) « d((i)e 4w , t>0 (8.3) 

Two asymptotic approximations are of interest: 

e b, P(W > t) -* a ; t ->• oo 
t~ l log P(W > t) -> ; oo (8.4) 

Also the following approximation is of interest, as suggested in Chang [6]: a bE(W). 

In many cases, the approximation in Equation (8.3) with a — 1 tends to be conservative, 
i.e. the bound, P(W > t) < e~ bt , holds under certain conditions [6, 7]. The one-parameter 
approximation 

P{W > t)^eT b ', t>0 (8.5) 



is the basis for the theory of effective bandwidths that will be discussed below. 

More extensive discussions of this class of approximations can be found in References 
[7, 8]. In this section Equation (8.3) is used as a reasonable approximation for an ATM cell 
stream modeled as a general Markovian source. Over the relatively short observation win- 
dows of interest (on the order of seconds), these approximations are fairly robust for a large 
class of real traffic sources, in particular, MPEG video sequences. Therefore, parameters 
[a(/x), b(pi)] provide a statistical characterization of the cell stream when offered to a queue 
with a constant service rate /x. A more complete characterization of the cell stream records 
the values [a(p,), b(fi)] for all /x in the range (X m , X p ) as follows: 

C s = {X p , X m , [a(n), b(fM)] : X m < [i < X p } (8.6) 

Values of [a(/x), b(fi)] are chosen such that Equation (8.3) holds approximately for values 
of t in range of interest. Assuming that Equation (8.3) holds at equality, we have (see 
Chapter 6) 

a(n)/b(iM) = E[W] = r r (/x)£[SJ + E[Q. d ] //x (8.7) 

In Equation (8.7) S d denotes the number of customers in service, Q d the number of customers 
in queue seen by an arbitrary customer arrival, and r r (/x) the average remaining service time 
for the customer in service as seen by an arriving customer conditional on there being a 
customer in service. Bearing in mind that 



E [S a ] = P(W > 0) = a{n) (8.8a) 
and defining q(pu) — E [<2aL Equation (8.7) gives 

bill) = — — (8.8b) 

q(ix) + /xa(/x)r r (/x) 



The estimation of the above parameters is discussed later in this section. Once these 
parameters are obtained they are used in Equation (8.6) for statistical characterization of a 
cell stream. The next step is to map the characterization to a suitable UPC descriptor. 
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8.1 .1 Effective traffic source 

To avoid UPC violation at the network access point, typically a shaper is applied at the 
source to force the cell stream to conform to the negotiated UPC, resulting in an effective 
traffic source. If the user can tolerate a larger shaping delay, a UPC descriptor with smaller 
network cost can be chosen for the connection. So, the problem of selecting the UPC values 
for a cell stream can be formulated as: 

min c (l p , X s , 5 S ) and P(shaping) < e and X m < A s < X p (8.9) 

where parameters (X m , X ? ) are assumed to be known a priori or obtained through measure- 
ments. The cost function c( ) is an increasing function of each argument and represents the 
cost to the network of accepting a call characterized by the UPC values (X p , X s , 5 S ). The 
shaping probability P( shaping), which depends on the cell stream characteristics, represents 
the probability that an arbitrary arriving cell will be delayed by the shaper. 

In the case of an idealized traffic characterization Cs(e), defined by Equation (8.2), the 
values of 5(/x, s)forX m < /x < X p are recorded, where B(pL, e) is the minimum bucket 
size that ensures a shaping probability less than s for a leaky bucket with rate /x. 

Alternatively, from the approximate statistical characterization, we can obtain an estimate 
B(C S , /x, s) for this minimum bucket size. Substituting 5(C S , it, e) in Equation (8.9), we 
have 

min c[X p , X s , (B(Cs, X s , s)] and X m < X s < X p (8.9a) 



8.1.2 Shaping probability 

In a x/D/1 model of the (/x, B) leaky bucket shaper, an arriving cell is delayed (shaped) if 
and only if the number of customers seen in the fictitious queue is B or greater. Otherwise, 
the cell is transmitted immediately without delay. Suppose that the user wants to choose the 
UPC descriptor (X p , X s , B s ) such that the probability that an arbitrary cell will be shaped is 
less than s. For a fixed leak rate [i, we would like to determine the minimum bucket size 
B(p., s) required to ensure a shaping probability less than e. On should remember that the 
shaping probability is an upper bound for the violation probability, i.e., the probability that 
an arriving cell is nonconforming, in the corresponding UPC policer. Let « a be a random 
variable representing the number in the fictitious queue observed by an arbitrary cell arrival. 
Suppose that the number of customers in the fictitious queue upon cell arrival is given by 
n — N, where N > 1 is an integer. This means that the new customer sees N — 1 customers 
in the queue and one customer currently in service. 

Therefore, the waiting time in the queue W' for this customer lies in the range 
[(N — l)//x, N/fi]. Also we can say, if W' for this customer lies in this range, then 
n — N at the arrival epoch. In other words n = N O (N — l)//x < W < N/fj, so n > 
N & (N - l)//x < W. 

Based on this we can write P(n. d > B) — P[W > (B — l)//x]. So by using Equation 
(8.3), the shaping probability can be expressed as 

P(shaping) = P(n. d >B)fv a(ix)&- hijl)(B - l)l ' 1 
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From this relation, for P (shaping) — s we get 

B(C S , n, e) = Ui/Hii)] log[fl(M)/e] + 1 (8.10) 

8.1.3 Shaping delay 

We can also specify the shaping constraint in terms of delay. Consider a cell stream passing 
through a leaky bucket traffic shaper with parameters (n, B). Recall that the state counter 
n may be viewed as the number of customers in a fictitious x/D/l queue. If W represents 
the waiting time in queue experienced by a given customer arriving in the fictitious queue 
served at rate fi, the actual delay experienced by the cell corresponding to the given fictitious 
customer can be expressed as 

D — max[0, W — (B — l)//x)] (8.11) 

Using Equation (8.3) and W from Equation (8.1 1) we have 

P(D > £> max ) » a (ii)t~ b ^ )D "-Q- b{ll){B - l)lil 

E[D] « [a(p,)/b(ii)]e- hilJ - )(B ~ Y)/l1 (8.12) 

Setting the expected delay equal to the target mean delay D gives B as 

B(C S , n, D) = [n/Hv)] log[a(ix)/b(n)D] + 1 (8.13) 

which will be referred to as mean delay constraint curve. 

The cost function c(X p , X s , B s ) represents the amount of resources that should be allocated 
to a cell characterized by the UPC parameters (X p , X s , B s ). For a source with deterministic 
periodic on-off sample paths and a random offset, the lengths of the on and off periods are 
given by 

Ton = BJ(X p - X s ) and T oS = BJX S (8.14) 

Consider an on-off source model in which the source alternately transmits at rate r during 
an on-period and is silent during an off-period. The mean on-period and off period lengths 
are given by /8~' and a -1 , respectively. For mathematical tractability, Erlang-fe distribution 
for on and off periods is assumed. 

The effective bandwidth of a source is defined as the minimum capacity required to serve 
the traffic source so as to achieve a specified steady-state cell loss probability e mux , where 
the multiplexer or switch buffer size is denoted by B mux . Using the result from Kobayashi 
and Ren [9], the effective bandwidth for this source model can be obtained as 

c(eff) = ra/(a + B) + r 2 afiK/(a + pf (8.15) 

where K — — loge mux /(2A:B mux ). If we set the mean on and off periods as /8~' = T on and 
or 1 = T 0 s, then by using Equation (8.14) in Equation (8.15), we have 

c(eff) = X s + X s (X p - X S )KB S /X p (8.16) 

Now, the following cost function for a UPC controlled source can be defined: 



c(X p , X s , B s ) = min[l p , c(eff)] = min[A p , X s + X s (X p - X s ) KB S /X p ] 



(8.17) 
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The value for parameter K should be chosen such that c(X p , X s , fi s ) approximates the 
bandwidth allocation of the network CAC policy. The limiting behavior of the cost function 
with respect to K is 

c(A p , X s , B s ) — > X s as K — > 0 and c(X p , X s , B s ) — > A p as — > oo 

Real-time UPC estimation is based on observations of the cell stream over a time period T . 
The procedure consists of the following steps: (1) estimate the mean and peak cell rates; (2) 
choose a set of candidate sustainable rates; (3) obtain the characterization of the stream; (4) 
for each candidate sustainable rate, compute the minimum bucket size required to meet a 
user constraint on shaping delay; and (5) choose the candidate rate that minimizes the cost 
function. 

The peak and mean rates can be estimated from observations of the cell stream over a 
window of length T. The interval is subdivided into smaller subintervals of equal length 
T s — T/M. For example, in experiments with MPEG video sequences described in Mark 
and Ramamurthy [5], T s was set equal to the frame period of 33 ms. If n is the total number of 
cell arrivals during T and «, the number of cell arrivals in the i'th subinterval, then the mean 
rate can be estimated as X m — n/T. For some applications, the peak rate may be known 
a priori. Otherwise, the peak rate can be estimated by X p = max{«;/ 7 S ; 1 < i < M}. 

Candidate rates (jli, . . . , fi N are chosen between the mean and peak rates. The simplest 
approach is to choose N and space the candidate rates at uniform spacing between the mean 
and peak rates. However, with a more judicious method of assignment, the set of candidate 
rates can be kept small, while still including rates close to the near-optimal sustainable 
rates. Suppose that a prior estimate of the operating sustainable rate, X s , is available. In 
the absence of prior knowledge, one could assign X s = (X m + X p )/2. The candidate rate 
/in is assigned equal to X s . The remaining rates are grouped into a set of Nc coarsely 
spaced rates and Nf finely spaced rates, with N — N c + Nf + 1 and with Nf even. The 
coarse rates fii, . . . , /j, Nc are assigned to be spaced uniformly over the interval (X s , X„) 
as /x; —X m + i A c , i' = l,..., N c , where A c is defined as A c = (X p — X m )/(N C + 1). The 
remaining iVf fine rates are assigned in the neighborhood of A. s as follows: i^j+N c — 5-s + 
j A f , j = 1, . . . , Nf/2, and /x j+Nc =X S - (j - N t /2)A f , j = Ay 2, . . . , N f , where A f = 
A c /(Af c + 1). Figure 8.1 shows an example of the choice of candidate rates example with 
N = 8, N c = 3 and N { = 4. 

Measurement of traffic parameters is based on the bank of queues. The i'th fictitious 
queue is implemented using a counter w, that is decremented at the constant rate and 
incremented by 1 whenever a cell arrives [5]. For each rate /x,- , estimates of the queue 
parameters fl(/x,) and b(pLf) are obtained. Each queue is sampled at the arrival epoch of 
an 'arbitrary' cell. This sampling is carried out by skipping a random number of A^ cells 
between the (j-l) st and jth sampled arrivals. Since all of the fictitious queues are sampled 
at the same arrival epochs, only one random number needs to be generated for all queues at 
a sampling epoch. Over an interval of length T, a number of samples, say M, are taken from 
the queues. At the j th sampling epoch, the following quantities are recorded for each queue: 
Sj, the number of customers in service (so Sj € {0, 1}); Qj, the number of customers in 
queue; and Tj , the remaining service time of the customer in service (if one is in service). 
After that the following sample means are computed: 



MM M 

a = J2 Sj/M; q = J2 Qj/ M > ir = J2 T j/ aM 
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Figure 8.1 Example of candidate rate assignment. 
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Parameter b is then computed by using Equation (8.8) as 



q + fiaf r q + a/2 

where f r is approximated by ^ji. The approximate statistical characterization is given by 
C s = { V k m [a(fii), b(Hi)] : i = I, . . . , N] 

Mapping to a UPC Descriptor is focused on the case where the user specifies a maximum 
mean delay D. The operating rate index is chosen to minimize the cost function defined in 
Equation (8.17): 

i* = arg min c [X p , /x,-, B s (C s , fi h D)] 

l<i<N L v /J 

where B s (Cs, /x,-, D) is defined in Equation (8.13) and represents the bucket size required 
to achieve a mean delay of D at a leak rate ji. Then the operating UPC descriptor is assigned 
as follows: 

A p = X p , X s = /x,», B s = B s (Cs, /x;, D) 

If the estimated mean and peak rates are close in value, the stream can be treated as a CBR 
stream. In this case, the UPC descriptor can simply be assigned according to the peak rate, i.e. 

Some applications, such as video delivery, interactive multimedia sessions, etc., cannot 
be adequately characterized by a static set of parameters that are expected to hold for 
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the entire session. The ability to renegotiate parameters during a session may be a viable 
way of efficiently supporting variable bit rate (VBR) traffic [5, 10] with real-time QoS 
constraints. 

8.1.4 Performance example 

In the MPEG standard for video compression, video is transmitted as a sequence of frames. 
For transmission over an ATM network, each MPEG frame must be segmented into ATM 
cells. In Mark and Ramamurthy [5], two modes of cell transmission over a frame interval are 
considered. In nonsmooth mode, the cells are transmitted at a constant rate A. p starting at the 
beginning of the frame interval until the frame is fully transmitted. Then the source is silent 
until the beginning of the next frame interval. This results in an on-off type stream with 
correlated on and off periods. In smooth mode, the cells for the z'th frame are transmitted 
at a constant rate f(i)/t over the frame interval, where f(i) is the total number of cells in 
the i— th frame and r is the interframe period. Thus, buffering is done to smooth out the 
transmission of cells of each frame over the corresponding frame interval. 

Figure 8.2 shows the characterization Cs obtained empirically for an MPEG video stream 
[5]. For this stream, the empirical mean and peak rates were determined to be 20 and 
40.5 cells/ms, respectively. The two streams were offered to 19 queues running in parallel at 
the rates fi = 21, 22, . . . , 39 (cells/ms) and the corresponding values for [a(fi), bifi)] were 
obtained according to the procedure described above. In the nonsmooth mode, a (/x) 1 
for all values of ji while for smooth mode, a(/x) monotonically decreases as /x increases 
from X m to A p . On the other hand the b (/x) curves for the nonsmooth and smooth modes 
are close for [i near to the extremes X m and A, p . For values of /x closer to the center of the 
interval (X m , k p ), the curve for smooth mode lies above the curve for nonsmooth mode. 
Thus, the characterization Cs captures the decrease in traffic burstiness due to intraframe 
smoothing. 

Figures 8.3 and 8.4 demonstrate that, for example, the shaping probability of 0.00001 at 
cell rate of 28 can be achieved with B = 1 100 in the case of smooth mode while nonsmooth 
mode would require B = 2000 for the same shaping probability. More interesting results 
on this issue can be found in Mark and Ramamurthy [5]. 
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Figure 8.2 (a) a and (b) b parameters for MPEG. 
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Figure 8.3 Shaping probability curves for nonsmooth mode. 
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Figure 8.4 Shaping probability curves for smooth mode. 
8.2 EFFECTIVE LINK LAYER CAPACITY 

In the previous section we discussed the problem of effective characterization of the traffic 
source in terms of the parameters which are used to negotiate with the network a certain 
level of QoS. Similarly, on the network side, to enable the efficient support of quality of 
service (QoS) in 4G wireless networks, it is essential to model a wireless channel in terms of 
connection-level QoS metrics such as data rate, delay and delay-violation probability. The 
traditional wireless channel models, i.e. physical-layer channel models, do not explicitly 
characterize a wireless channel in terms of these QoS metrics. In this section, we discuss 
a link-layer channel model referred to as effective capacity (EC) [11]. In this approach, a 
wireless link is modeled by two EC functions, the probability of nonempty buffer and the 
QoS exponent of a connection. Then, a simple and efficient algorithm to estimate these EC 
functions is discussed. The advantage of the EC link-layer modeling and estimation is ease 
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of translation into QoS guarantees, such as delay bounds and hence, efficiency in admission 
control and resource reservation. 

Conventional channel models, discussed in Chapter 3, directly characterize the fluctua- 
tions in the amplitude of a radio signal. These models will be referred to as physical-layer 
channel models, to distinguish them from the link-layer channel model discussed in this 
section. Physical-layer channel models provide a quick estimate of the physical-layer per- 
formance of wireless communications systems (e.g. symbol error rate vs SNR). However, 
physical-layer channel models cannot be easily translated into complex link-layer QoS 
guarantees for a connection, such as bounds on delay. The reason is that these complex QoS 
requirements need an analysis of the queueing behavior of the connection, which cannot be 
extracted from physical-layer models. Thus, it is hard to use physical-layer models in QoS 
support mechanisms, such as admission control and resource reservation. 

For these reasons, it was proposed to move the channel model up the protocol stack 
from the physical layer to the link layer. The resulting model is referred to as EC link model 
because it captures a generalized link-level capacity notion of the fading channel. Figure 8.5 
illustrates the difference between the conventional physical-layer and link-layer model. For 
simplicity, the 'physical-layer channel' will be called the 'physical channel' and 'link-layer 
channel' will be referred to as the 'link'. 



8.2.1 Link-layer channel model 

4G wireless systems will need to handle increasingly diverse multimedia traffic, which are 
expected to be primarily packet-switched. The key difference between circuit switching 
and packet switching, from a link-layer design viewpoint, is that packet switching requires 
queueing analysis of the link. Thus, it becomes important to characterize the effect of the data 
traffic pattern, as well as the channel behavior, on the performance of the communication 
system. 

QoS guarantees have been heavily researched in the wired networks [e.g. ATM and 
Internet protocol (IP) networks]. These guarantees rely on the queueing model shown in 
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Figure 8.5 Packet-based wireless communication system. 
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Figure 8.6 Queueing system model. 



Figure 8.6 and studied in Chapter 6. This figure shows that the source traffic and the network 
service are matched using a first-in first-out buffer (queue). Thus, the queue prevents loss of 
packets that could occur when the source rate is more than the service rate, at the expense 
of increasing the delay. Queueing analysis, which is needed to design appropriate admis- 
sion control and resource reservation algorithms, requires source traffic characterization 
and service characterization. The most widely used approach for traffic characterization is 
to require that the amount of data (i.e. bits as a function of time t) produced by a source 
conform to an upper bound, called the traffic envelope, V(t). The service characteriza- 
tion for guaranteed service is a guarantee of a minimum service (i.e. bits communicated 
as a function of time) level, specified by a service curve SC *(0[12]. Functions F(t) 
and ^(r) are specified in terms of certain traffic and service parameters, respectively. Ex- 
amples include the UPC parameters, discussed in the previous section, used in ATM for 
traffic characterization, and the traffic specification (T-SPEC) and the service specification 
(R-SPEC) fields used with the resource reservation protocol [12] in IP networks. 

A traffic envelope T(t) characterizes the source behavior in the following manner: over 
any window of size t, the amount of actual source traffic A(t) does not exceed Y(t) (see 
Figure 8.7). For example, the UPC parameters, discussed in Section 8.1, specifies V(t) by 

r(f) = min {x ( ft, X^t + a (s) ) (8.18) 

where X ( ^ is the peak data rate, X ( ^ the sustainable rate, and er (5) = B s the leaky-bucket size 
[12]. As shown in Figure 8.7, the curve F(t) consists of two segments: the first segment 
has a slope equal to the peak source data rate X^\ while the second has a slope equal to 
the sustainable rate Xf\ with Ajf' < X^\ a (s) is the v-axis intercept of the second segment. 
T(r) has the property that A(t ) < F(t) for any time t. Just as T(f ) upper bounds the source 
traffic, a network SC ^(t) lower bounds the actual service S(t) that a source will receive. 
*(f) has the property that *(f) < S(t) for any time t. Both T(t) and *(r) are negotiated 
during the admission control and resource reservation phase. An example of a network SC 
is the R-SPEC curve used for guaranteed service in IP networks 

*(f) = max [X ( f\t - a (c) ), 0] = [X ( f\t - a (e) )] + (8.19) 

where X ( f^ is the constant service rate, and cr (c) the delay (due to propagation delay, link 
sharing, and so on). This curve is illustrated in Figure 8.7. ^>(t) consists of two segments; 
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Figure 8.7 Traffic and service characterization. (Reproduced by permission of IEEE [11].) 

the horizontal segment indicates that no packet is being serviced due to propagation delay, 
etc., for a time interval equal to the delay er (c) , while the second segment has a slope equal to 
the service rate X^K In the figure, the horizontal difference between A(t) and S(t), denoted 
D(r), is the delay experienced by a packet arriving at time t, and the vertical difference 
between the two curves, denoted by Q(r), is the queue length built up at time r, due to 
packets that have not been served yet. 

As discussed in Chapter 4, providing QoS guarantees over wireless channels requires 
accurate models of their time-varying capacity, and effective utilization of these models 
for QoS support. The simplicity of the SCs discussed earlier motivates us to define the 
time-varying capacity of a wireless channel as in Equation (8.19). Specifically, we hope to 
lower bound the channel service using two parameters: the channel sustainable rate 
and the maximum fade duration er (c) . 

Parameters and <7 (e) are meant to be in a statistical sense. The maximum fade duration 
er (e) is a parameter that relates the delay constraint to the channel service. It determines the 
probability sup, Pr{S(t) < We will see later that ct (c) is specified by the source with 

ct (c) = Z> max , where D is the delay bound required by the source. 

However, physical-layer wireless channel models do not explicitly characterize the chan- 
nel in terms of such link-layer QoS metrics as data rate, delay and delay-violation probability. 
For this reason, we are forced to look for alternative channel models. 

A problem that surfaces is that a wireless channel has a capacity that varies randomly with 
time. Thus, an attempt to provide a strict lower bound [i.e. the deterministic SC *(0, used 
in IP networks] will most likely result in extremely conservative guarantees. For example, 
in a Rayleigh or Ricean fading channel, the only lower bound that can be deterministically 
guaranteed is a capacity of zero. The capacity here is meant to be delay-limited capacity, 
which is the maximum rate achievable with a prescribed delay bound (see Hanly and Tse 
[13] for details). This conservative guarantee is clearly useless in practice. Therefore, the 
concept of deterministic SC *(f) is extended to a statistical version, specified as the pair 
{^(t), e}. The statistical SC {^(t), s) specifies that the service provided by the channel, 
denoted S(t), will always satisfy the property that sup, Pr{S(f) < *(?)} < e. In other words, 
s is the probability that the wireless channel will not be able to support the pledged SC 
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*(f), referred to as the outage probability. For most practical values of e, a nonzero SC 
ty(t) can be guaranteed. 

8.2.2 Effective capacity model of wireless channels 

From the previous discussion we can see that for the QoS control we need to calculate an 
SC ty(t) such that, for a given s > 0, the following probability bound on the channel service 
S(t) is satisfied: 

sup Pr {5(f) < *(r)} < e and = [k^(t - a (e) )] + (8.20) 
t 

The statistical SC specification requires that we relate its parameters {X<fK <r (e) , e} to the 
fading wireless channel, which at the first sight seems to be a hard problem. At this point, 
the idea that the SC 4'(r) is a dual of the traffic envelope T(t) is used [11]. A number of 
papers exist on the so-called theory of effective bandwidth [14], which models the statistical 
behavior of traffic. In particular, the theory shows that the relation sup, Pr { Q(t) > B} < s 
is satisfied for large B , by choosing two parameters (which are functions of the channel rate r ) 
that depend on the actual data traffic, namely, the probability of nonempty buffer, and the 
effective bandwidth of the source. Thus, a source model defined by these two functions 
fully characterizes the source from a QoS viewpoint. The duality between Equation (8.20) 
and sup, Pr{ Q(t) > B] < s indicates that it may be possible to adapt the theory of effective 
bandwidth to SC characterization. This adaptation will point to a new channel model, which 
will be referred to as the effective capacity (EC) link model. Thus, the EC link model can be 
thought of as the dual of the effective bandwidth source model, which is commonly used 
in networking. 

8.2.2. 1 Effective bandwidth 

The stochastic behavior of a source traffic can be modeled asymptotically by its effective 
bandwidth. Consider an arrival process {A(t), t > 0}, where A(t ) represents the amount of 
source data (in bits) over the time interval [0, t). Assume that the asymptotic log-moment 
generating function of A(t), defined as 

A(u) = lim - log E \e uA(,) ] (8.21) 

t^oo t 

exists for all u > 0. Then, the effective bandwidth function of A(t) is defined as [11, 14] 

A(m) 

a( M )=— ^, Vm>0. (8.22) 
u 

This becomes more evident if we assume the constant traffic A(t) — A so that Equation 
(8.21) gives 

1 1 uA A 

A(m) = lim - log E |e" A(f) l = lim - log E \e uA ] = lim — = u lim — = ua(u) 

t-*oo t t^oo t t^oo t t^oo t 

(8.21a) 

Consider now a queue of infinite buffer size served by a channel of constant service rate 
r, such as an AWGN channel. Owing to the possible mismatch between A(t) and S(t), the 
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queue length Q(t) could be nonzero. As already discussed in Section 8.1 [analogous to 
Equation (8.5)] or by using the theory of large deviations [14] , it can be shown that the 
probability of Q(t ) exceeding a threshold B satisfies 

sup Pr{ > B} ~ & - 6Bir)B asB^oo (8.23) 

where f(x) ss g(x) means that limine f(x)/g(x) — 1. For smaller values of B, the fol- 
lowing approximation, analogous to Equation (8.3), is more accurate [15]: 

supPr{0(r) > B] « y(r)e- eB<r)B (8.24) 
t 

where both y(r) and 9g(r) are functions of channel capacity r. According to the theory, 
y(r) = Pr {Q(t) > 0} is the probability that the buffer is nonempty for randomly chosen 
time t, while the QoS exponent 6b is the solution of a(6s) = r. Thus, the pair of functions 
iy( r ), &b(i")} model the source. Note that 6s(r) is simply the inverse function corresponding 
to the effective bandwidth function a(u). 

If the quantity of interest is the delay D(t) experienced by a source packet arriving at 
time t, then with same reasoning the probability of D(t) exceeding a delay bound D max 
satisfies 

supPr{D(0 > D mm ] » y(r)e~ e(r)D ™ (8.25) 

t 

where 9(r) — 9 B (r) x r [16]. Thus, the key point is that for a source modeled by the pair 
{y(r), 0(r)}, which has a communication delay bound of D max , and can tolerate a delay- 
bound violation probability of at most e, the effective bandwidth concept shows that the 
constant channel capacity should be at least r, where r is the solution to e = y(r)e~ e(r)£>max . 
In terms of the traffic envelope T(f) (Figure 8.7), the slope = r and cr (s) = rD m . dx . 

In Section 8.1 a simple and efficient algorithm to estimate the source model functions 
y(r) and 0(r) was discussed. In the following section, we use the duality between traffic 
modeling {y(r), 9(r)}, and channel modeling to present an EC link model, specified by a 
pair of functions {y (c \pL), # (c) (/x)}. The intention is to use {y (c \p,), # (c) (/z)} as the channel 
duals of the source functions {y(r), 6(r)}. Just as the constant channel rate r is used in 
source traffic modeling, we use the constant source traffic rate \i in modeling the channel. 
Furthermore, we adapt the source estimation algorithm from Section 8.1 to estimate the 
link model parameters (y (e) (/x), 6 {c) (^i)}. 

8.2.2.2 Effective capacity link model 

Let r(t) be the instantaneous channel capacity at time t. Define S(t) — / 0 ' r(r) dr , which is 
the service provided by the channel. Note that the channel service S(t ) is different from the 
actual service S(t) received by the source; S(t) only depends on the instantaneous channel 
capacity and thus is independent of the arrival A(t ). Paralleling the development of Equation 
(8.21) and (8.22) we assume that 



A (e) (- M )= lim - logE\e-"~ m ] 
t^oo t L J 



(8.26) 
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exists for all u > 0. This assumption is valid, for example, for a stationary Markov-fading 
process r(t). Then, the EC function of r(f) is defined as 

a< e »( M ) = - -, Vw > 0 (8.27) 

u 

Consider a queue of infinite buffer size supplied by a data source of constant data rate 
\x (see Figure 8.6). The theory of effective bandwidth can be easily adapted to this case. 
The difference is that, whereas in the previous case the source rate was variable while the 
channel capacity was constant, now the source rate is constant while the channel capacity is 
variable. Similar to Equation (8.25), it can be shown that the probability of D(t) exceeding 
a delay bound D max satisfies 

supPr{D(0 > D m . dx } « y (c V)e~ £,< ' V)Dm ™ (8.28) 
t 

where {y (e) (/u,), (9 (c) (/x)} are functions of source rate /x. This approximation is accurate for 
large D max , but we will see later in the simulation results, that this approximation is also 
accurate even for smaller values of Z) max . 

For a given source rate /x, y (c '(m) = Pr { Q(t) > 0} is again the probability that the buffer 
is nonempty at a randomly chosen time t, while the QoS exponent # (e '(/x) is defined as 
#(/x) — fia^ 1 (ijL), where ce~ l (-) is the inverse function of a (c \u). Thus, the pair of functions 
{y (e) (/x), <9 (e) (xt)} model the link. 

So, if a link that is modeled by the pair {y (c) (xt), # (c) (/x)} is used, a source that requires a 
communication delay bound of D m . dx , and can tolerate a delay-bound violation probability 
of at most e, needs to limit its data rate to a maximum of /x, where /x is the solution to 
e = y ( e )( (a )e _£,<£)< J x ) D ™* . In terms of the SC *(f ) shown in Figure 8.7, the channel sustainable 
rate A.* e) = /x and a (e) = Z) max . 

If the channel-fading process r(t) is stationary and ergodic, then a simple algorithm 
to estimate the functions {y (e) (/x), # (e) (/x)} is similar to the one described in Section 8.1. 
Paralleling Equation (8.7) we have 

E [D(t)] = z s (n) H (8.29) 



0«(/x) ^ /x 

y (c \n) = Pr{Z)(0 > 0} (8.30) 

where t s (/x) is the average remaining service time of a packet being served. Note that r s (xt) 
is zero for a fluid model (assuming infinitesimal packet size). Now, the delay D(t) is the sum 
of the delay incurred due to the packet already in service, and the delay in waiting for the 
queue Q(t) to clear which results in Equation (8.29), using Little's theorem. Substituting 
Z) max = 0 in Equation (2.28) results in Equation (3.30). As in Section 8.1, solving Equation 
(8.29) for (9 (e) (/x) gives similarly to Equation (8.8a) 

9 M = r TT ,1,, ( 8 - 31 ) 

/x x t s (n) + E [Q(t)] 

According to Equation (8.30) and (8.31) , as in Section 8.1, the functions y and 6 can be 
estimated by estimating Pr{D(r) > 0} , r s (/x), and E [Q(t)]. The latter can be estimated 
by taking a number of samples, say N, over an interval of length T, and recording the 
following quantities at the wfh sampling epoch: S n the indicator of whether a packets is in 
service (S„ € {0, 1}), Q n the number of bits in the queue (excluding the packet in service), 
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and T n the remaining service time of the packet in service (if there is one in service). Based 
on the same measurements, as in Section 8.1, 

N 

y = Y, S„/N 

n=\ 

N 

n=l 
N 

% = J2 T "/ N 

n=\ 

are computed and then, from Equation (8.31), we have 

y x a 

9 = — — (8.32) 

[L x r s + q 

These parameters are used to predict the QoS by approximating Equation (8.28) with 

supPr{D(f) > L> max } w pe"^- (8.33) 
t 

If the ultimate objective of EC link modeling is to compute an appropriate SC ^(t), then, 
given the delay-bound D m . dx and the target delay-bound violation probability £ of a connec- 
tion, we can find *(?) = {er (c) , A.* e) } by setting er (e) = Z) max , solving Equation (8.33) for /U. 
and setting Ajf' = ji. 



8.2.3 Physical layer vs link-layer channel model 

In Jack's model of a Rayleigh flat-fading channel, the Doppler spectrum S( f) is given as 

1.5 



S(f) = 



^r/mVl - (F/f m ) 2 



(8.34) 



where f m is the maximum Doppler frequency, f c is the carrier frequency, and F — f — f c . 
Below we show how to calculate the EC for this channel [11]. 

Denote a sequence of ,/V measurements of the channel gain, spaced at a time interval 
S apart, by x = [xq, x\, ■ ■ ■ , x^-i], where {x n , n e [0, N — 1]} are the complex-valued 
Gaussian distributed channel gains (\x n | are, therefore, Rayleigh distributed). For simplicity, 
the constant noise variance will be included in the definition of x n . The measurement x„ 
is a realization of a random variable sequence denoted by X n , which can be written as the 
vector X = [Xo, X\, ■ ■ ■ , X N _\\. The pdf of a random vector X for the Rayleigh-fading 
channel is 

/ X (X) = — e - xR_1,[H (8.35) 



where R is the covariance matrix of the random vector X, det(R) the determinant of matrix 
R, and x H the conjugate transpose (Hermitan) of x. To calculate the EC, we start with 



E[e 



-«S(0n _ 



] = E 



exp 



Jo 



r(r) dr 











"7 exp 


— u 



N-l 



8 x Kr„) 



n=0 



/x(x) dx 
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(/>) 



(c) 



I 



N-l 



exp l —u 



^log(l + |x„| 2 ) 

n=0 

N-l 

^log(l + |x„| 2 ) 



«=0 



/x(x) dx 
1 

'jr N det(K)~ 



(8.36) 



_ e -xR 'x H dx 



where (a) approximates the integral by a sum, (b) is the Shannon result for channel capacity 
(i.e. y(r„) — log(l + \x n \ 2 ), and (c) is from Equation (8.35). This gives the EC, Equation 
(8.27), as 



-1 



a (c> (u) = — lim log / exp { —u 



N-l 



^51og(l + |x„| 2 ) 



n=0 



1 



Tr^detCR) 



e -xR-'x H dx 



(8.37) 



Using the approximation (a)log(l + \x n \ 2 ) « \x n \ 2 for low SNR [11], Equation (8.37) can 
be further simplified as 



£[ e -»S(0] g j cxp 

(b) 

i 



(a) /" 



-w<5 I ^ |x„| 



\n=0 



1 



Tr^de^R) 



e -xR >x H dx 



/* -«5||x|| 2 1 

J n N dt 



det(R) 



-xR 'x" j (*) ' 

-e dx 



jt'v det(R) 



J e -x(R->+„M)xH dx ( g 3g) 



7t N det(R) 



jr^det^R- 1 +M5IF 1 ] = 



1 



det( M 5R + 1) 



where approximation (b) is due to the definition of the norm of the vector x, and (c) the 
relation ||x|| 2 = xx H (I is identity matrix). Reference [11] considers three cases of interest 
for Equation (8.38). 



8.2.3. 1 High mobility scenario 

In the extreme high mobility (HM) case there is no correlation between the channel samples 
and we have R = r\, where r — E \x n \ 2 is the average channel capacity. From Equation 
(8.38), we have 



E[e-«SW] 



1 



1 



1 



det( M 5R + I) (urS + l) N (urt/N + 1) N 
where S tj N. As the number of samples N — > oo, we have 



lim £[e-" s(0 ] « lim (urt/N + 1)~ N = e" 

N->oo N—toc 



-N _ „-urt 



(8.39) 



(8.40) 



Thus, in the limiting case, the Rayleigh-fading channel reduces to an AWGN channel. Note 
that this result would not apply at high SNRs because of the concavity of the log(-) function. 
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8.2.3.2 Stationary scenario 



In this case all the samples are fully correlated, R = [Rij] — [r]. In other words all elements 
of R are the same and Equation (8.38) now gives 



£[e 



-uS(t) 



1 



1 



1 



1 



det(w<5R + I) urSN + 1 



ur x — x N + 1 
N 



1 + urt 



(8.41) 



8.2.3.3 General case 

Denote the eigenvalues of matrix R by [k n ,n € [0, N — 1]}. Since R is symmetric, we 
have R = UDU H , where U is a unitary matrix, U H is its Hermitian, and the diagonal matrix 
D = diag(lo, M, • • • . Aa?— l)- From Equation (8.38), we have 



E [e -»«'>] 



1 



1 



det( M 5R + I) det( M SUDU H + UU H ) 
1 

det[U diag (m«5A 0 + \,u8X\+ 1, 
1 



exp 



U„(.uSk n + 1) 
We now use the calculated E[s~ uS ^] to get 



u8\ N _! + 1)U H ] 
- ^2\og(uSX n + 1) 



(8.42) 



A (c \-u) = lim -log£[e-" S(0 ]« lim - log exp 



^log(t<<5A„ + 1) 

n 

= lim -Af V log f m^L + l) = - f \og(uS(f) + 1) d/ (8.43) 

where (a) follows from Equation (8.42), (b) follows from the fact that the frequency interval 
Af — l/t and the bandwidth B w = l/S, and (c) from the fact that the power spectral density 
S( f) — Xn/By, and that the limit of a sum becomes an integral. This gives the EC, Equation 
(8.27), as 



a {c \u) = 



flog(uS(f) + l)df 



(8.44) 



Thus, the Doppler spectrum allows us to calculate a {c) (u). The EC function, Equation (8.44), 
can be used to guarantee QoS using Equation (8.28). 

One should keep in mind that the EC function, Equation (8.44), is valid only for a Rayleigh 
flat-fading channel, at low SNR. At high SNR, the EC for a Rayleigh-fading channel is 
specified by the complicated integral in Equation (8.37). To the best of our knowledge, a 
closed-form solution to Equation (8.37) does not exist. It is clear that a numerical calculation 
of EC is also very difficult, because the integral has a high dimension. Thus, it is difficult to 
extract QoS metrics from a physical-layer channel model, even for a Rayleigh flat-fading 
channel. The extraction may not even be possible for more general fading channels. In 
contrast, the EC link model that was described in this section can be easily translated into 
QoS metrics for a connection, and we have shown a simple estimation algorithm to estimate 
the EC model functions. 
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Figure 8.8 Queueing model used for simulations. 



8.2.4 Performance examples 

The discrete-time system depicted in Figure 8.8 is simulated. The data source generates 
packets at a constant rate fi which are first sent to the (infinite) buffer at the transmitter, 
whose queue length is Q n , where n refers to the rath sample interval. The head-of-line packet 
in the queue is transmitted over the fading channel at data rate r n . The fading channel has 
a random channel gain x n (the noise variance is absorbed into x n ). We use a fluid model 
that is the size of a packet is infinitesimal. A perfect knowledge of the channel gain x„ (the 
SNR, really) at the transmitter side is assumed. Therefore, as described in Chapter 4, it 
can use rate-adaptive transmissions and strong channel coding to transmit packets without 
errors. Thus, the transmission rate r n is equal to the instantaneous (time-varying) capacity 
of the fading channel, defined by the Shannon law, r n — B c log 2 (l + \x„ | 2 ) where B c is the 
channel bandwidth. 

The average SNR is fixed in each simulation run. We define r g as the capacity of an 
equivalent AWGN channel, which has the same average SNR, i.e. rg = 5 c log 2 (l + SNR avg ) 
where SNR avg is the average SNR, i.e. E \x n \ 2 . Then, r„/r g relation 

= raw g nl0g 2 (l+l*„| 2 ) 

log 2 (l + SNR avg ) 

Simulation parameters as in Wu and Negi [11] were used. Channel samples x n are gener- 
ated by the following AR(1) (autoregressive) model: x n = kx n _\ + v n where the modeling 
error v n is zero-mean complex Gaussian with unit variance per dimension and is statisti- 
cally independent of x„-\ . The coefficient k can be determined by the following procedure: 
( 1 ) compute the coherence time T c by T c & 9/1 6nf m , where the coherence time is defined as 
the time over which the time autocorrelation function of the fading process is above 0.5; (2) 



254 EFFECTIVE CAPACITY 




0 -I 1 , 1 , r- 

30 40 50 60 70 80 

Source rate (kb/s) 



Figure 8.9 Estimated function y(fi) vs source rate /x. 

compute the coefficient k by k — 0.5 T ^ Tc where 7 S is the sampling interval. The other pa- 
rameters are: f m = 5-30 Hz, r g = 100 kb/s, average SNR = 0/15 dB, T s = 1 ms, bit rate 
ix = 30-85 kb/s. 

Figures 8.9 and 8.10 show the estimated EC functions y(fi) and 0(/i). As the source 
rate [i increases from 30 to 85 kb/s, y(/x) increases, indicating a higher buffer occupancy, 
while Q([i) decreases, indicating a slower decay of the delay-violation probability. Thus, 
the delay- violation probability is expected to increase with increasing source rate /x. From 
Figure 8.10, we also observe that SNR has a substantial impact on /(/x). This is because 
higher SNR results in larger channel capacity, which leads to smaller probability that a 
packet will be buffered, i.e. smaller y(n). In contrast, Figure 8.9 shows that f m has little 
effect on y(/x). 
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Figure 8.10 Estimated function 9{fi) vs source rate [i. 
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Delay bound D max (ms) Delay bound D max (ms) 

Figure 8.11 Actual delay-violation probability vs £> max for various Doppler rates, (a) 
Rayleigh fading and (b) Ricean fading K — 3. 

Figure 8.11 shows the actual delay-violation probability sup, Pr{£)(f) > Z) max } vs the 
delay bound Z) max , for various Doppler rates. It can be seen that the actual delay-violation 
probability decreases exponentially with the delay bound Z) max , for all the cases. This justi- 
fies the use of an exponential bound, Equation (8.33), in predicting QoS, thereby justifying 
the link model {y, 9}. The figure shows that delay- violation probability reduces with the 
Doppler rate. This is reasonable since the increase of the Doppler rate leads to the increase 
of time diversity, resulting in a larger decay rate 6(fi) of the delay- violation probability. 
More details on the topic can be found in References [11-17]. 
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9 

Adaptive TCP Layer 



9.1 INTRODUCTION 

In this section we first discuss TCP performance independent of the type of network by 
considering the different possible characteristics of the connection path. We present the 
problems and the different possible solutions. This study permits us to understand the 
limitations of the actual solutions and the required modifications to let TCP cope with a 
heterogeneous Internet on an end-to-end basis. Then, in the rest of the chapter we focus on 
the specifics of TCP operation in wireless networks. 

The TCP provides a reliable connection-oriented in-order service to many of today's 
Internet applications. Given the simple best-effort service provided by IP, TCP must cope 
with the different transmission media crossed by Internet traffic. This mission of TCP is 
becoming difficult with the increasing heterogeneity of the Internet. Highspeed links (optic 
fibers), long and variable delay paths (satellite links), lossy links (wireless networks) and 
asymmetric paths (hybrid satellite networks) are becoming widely embedded in the Internet. 
Many works have studied by experimentation [1], analytical modeling [2] and simulation 
[3, 4, 12, 17, 18] the performance of TCP in this new environment. 

Most of these works have focused on a particular environment (satellite networks, mo- 
bile networks, etc.). They have revealed some problems in the operation of TCP. Long 
propagation delay and losses on a satellite link, handover and fading in a wireless net- 
work, bandwidth asymmetry in some media, and other phenomena have been shown to 
seriously affect the throughput of a TCP connection. A large number of solutions have been 
proposed. Some solutions suggest modifications to TCP to help it to cope with these new 
paths. Other solutions keep the protocol unchanged and hide the problem from TCP. In this 
section we consider the different characteristics of a path crossed by TCP traffic, focusing 
on bandwidth-delay product (BDP), RTT, noncongestion losses and bandwidth asymmetry. 
TCP is a reliable window-based ACK-clocked flow control protocol. It uses an additive- 
increase multiplicative-decrease strategy for changing its window as a function of network 
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conditions. Starting from one packet, or a larger value as we will see later, the window is 
increased by one packet for every nonduplicate ACK until the source estimate of network 
propagation time (npt) is reached. By the propagation time of the network, sometimes called 
the pipe size, we mean the maximum number of packets that can be fit on the path, which is 
also referred to as network capacity. This is the slow start (SS) phase, and the npt estimate 
is called the SS threshold (sst). SS aims to alleviate the burstiness of TCP while quickly 
filling the pipe. Once sst is reached, the source switches to a slower increase in the window 
by one packet for every window's worth of ACKs. This phase, called congestion avoidance 
(CA), aims to slowly probe the network for any extra bandwidth. The window increase 
is interrupted when a loss is detected. Two mechanisms are available for the detection of 
losses: the expiration of a retransmission timer (timeout) or the receipt of three duplicate 
ACKs (fast retransmit, FRXT). The source supposes that the network is in congestion and 
sets its estimate of the sst to half the current window. 

Tahoe, the first version of TCP to implement congestion control, at this point sets the 
window to one packet and uses SS to reach the new sst. Slow starting after every loss 
detection deteriorates the performance given the low bandwidth utilization during SS. When 
the loss is detected via timeout, SS is unavoidable since the ACK clock has stopped and SS 
is required to smoothly fill the pipe. However, in the FRXT case, ACKs still arrive at the 
source and losses can be recovered without SS. This is the objective of the new versions 
of TCP (Reno, New Reno, SACK, etc.), discussed in this chapter, that call a fast recovery 
(FRC V) algorithm to retransmit the losses while maintaining enough packets in the network 
to preserve the ACK clock. Once losses are recovered, this algorithm ends and normal CA 
is called. If FRCV fails, the ACK stream stops, a timeout occurs, and the source resorts to 
SS as with Tahoe. 



9.1.1 A large bandwidth-delay product 

The increase in link speed, like in optic fibers, has led to paths of large BDP The TCP window 
must be able to reach large values in order to efficiently use the available bandwidth. Large 
windows, up to 2 30 bytes, are now possible. However, at large windows, congestion may 
lead to the loss of many packets from the same connection. Efficient FRCV is then required 
to correct many losses from the same window. Also, at large BDP, network buffers have 
an important impact on performance. These buffers must be well dimensioned and scale 
with the BDP. Fast recovery uses the information carried by ACKs to estimate the number 
of packets in flight while recovering from losses. New packets are sent if this number falls 
below the network capacity estimate. The objective is to preserve the ACK clock in order to 
avoid the timeout. The difference between the different versions of TCP is in the estimation 
of the number of packets in flight during FRCV. All these versions will be discussed later in 
more detail. Reno considers every duplicate ACK a signal that a packet has left the network. 
The problem of Reno is that it leaves FRCV when an ACK for the first loss in a window 
is received. This prohibits the source from detecting the other losses with FRXT. A long 
timeout is required to detect the other losses. New Reno overcomes this problem. The idea 
is to stay in FRCV until all the losses in the same window are recovered. Partial ACKs are 
used to detect multiple losses in the same window. This avoids timeout but cannot result 
in a recovery faster than one loss per RTT. The source needs to wait for the ACK of the 
retransmission to discover the next loss. Another problem of Reno and New Reno is that 
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they rely on ACKs to estimate the number of packets in flight. ACKs can be lost on the 
return path, which results in an underestimation of the number of packets that have left 
the network, and thus an underutilization of the bandwidth during FRCV and, in the case 
of Reno, a possible failure of FRCV. More information is needed at the source to recover 
faster than one loss per RTT and to estimate more precisely the number of packets in the 
pipe. This information is provided by selective ACK (SACK), a TCP option containing the 
three blocks of contiguous data most recently received at the destination. Many algorithms 
have been proposed to use this information during FRCV. TCP-SACK may use ACKs to 
estimate the number of packets in the pipe and SACKs to retransmit more than one loss 
per RTT. This leads to an important improvement in performance when bursts of losses 
appear in the same window, but the recovery is always sensitive to the loss of ACKs. As 
a solution, forward ACK (FACK) may be used, which relies on SACK in estimating the 
number of packets in the pipe. The number and identity of packets to transmit during FRCV 
is decoupled from the ACK clock, in contrast to TCP-SACK, where the identity is only 
decoupled. 



9.1 .2 Buffer size 

SS results in bursts of packets sent at a rate exceeding the bottleneck bandwidth. When 
the receiver acknowledges every data packet, the rate of these bursts is equal to twice the 
bottleneck bandwidth. If network buffers are not well dimensioned, they will overflow early 
during SS before reaching the network capacity. This will result in an underestimation of 
the available bandwidth and a deterioration in TCP performance. Early losses during SS 
were first analyzed in Lakshman and Madhow [2]. The network is modeled with a single 
bottleneck node of bandwidth \i, buffer B, and two-way propagation delay T, as shown in 
Figure 9.1. 

A long TCP-Tahoe connection is considered where the aim of SS is to reach quickly 
without losses sst, which is equal to half the pipe size [(B + fiT)/2]. In the case of a 
receiver that acknowledges every data packet, they found that a buffer B > BDP/3 — fj.T/3 
is required. Their analysis can be extended to an SS phase with a different threshold, mainly 
to that at the beginning of the connection, where sst is set to a default value. As an example, 
the threshold can be set at the beginning of the connection to the BDP in order to switch to 
CA before the occurrence of losses. This will not work if the buffer is smaller than half 
the BDP (half /xT/2). In Barakat and Altman [5] the problem of early buffer overflow 
during SS for multiple routers was studied. It was shown that, due to the high rate at which 




Figure 9.1 One-hop TCP operation. 
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packets are sent during SS, queues can build up in routers preceding the bottleneck as well. 
Buffers in these routers must also be well dimensioned, otherwise they overflow during 
SS and limit the performance even though they are faster than the bottleneck. With small 
buffers, losses during SS are not a signal of network congestion, but rather of transient 
congestion due to the bursty nature of SS traffic. Now in CA, packets are transmitted at 
approximately the bottleneck bandwidth. A loss occurs when the window reaches the pipe 
size. The source divides its window by two and starts a new cycle. To always get a throughput 
approximately equal to the bottleneck bandwidth, the window after reduction must be larger 
than the BDP. This requires a buffer B larger than the BDP. Note that we are talking about 
drop tail buffers, which start to drop incoming packets when the buffer is full. Active buffers 
such as random early detection (RED) [3] start to drop packets when the average queue 
length exceeds some threshold. When an RED buffer is crossed by a single connection, the 
threshold should be larger than the BDP to get good utilization. This contrasts one of the 
aims of RED: limiting the size of queues in network nodes in order to reduce end-to-end 
delay. For multiple connections, a lower threshold is sufficient given that a small number of 
connections reduce their windows upon congestion, in contrast to drop tail buffers, where 
often all the connections reduce their windows simultaneously . 

9.1.3 Round-trip time 

Long RTTs are becoming an issue with the introduction of satellite links into the Internet. A 
long RTT reduces the rate at which the window increases, which is a function of the number 
of ACKs received and does not account for the RTT. This poses many problems to the 
TCP. First, it increases the duration of SS, which is a transitory phase designed to quickly 
but smoothly fill the pipe. Given the low bandwidth utilization during SS, this deteriorates 
the performance of TCP transfers, particularly short ones (e.g. Web transfers). Second, it 
causes unfairness in the allocation of the bottleneck bandwidth. Many works have shown 
the bias of TCP against connections with long RTTs [2] . Small RTT connections increase 
their rates more quickly and grab most of the available bandwidth. The average throughput 
of a connection has been shown to vary as the inverse of T", where a is a factor between 1 
and 2 [2]. 

Many solutions have been proposed to reduce the time taken by SS on long delay 
paths. These solutions can be divided into three categories: (1) change the window increase 
algorithm of TCP; (2) solve the problem at the application level; or (3) solve it inside the 
network. On the TCP level the first proposition was to use a larger window than one packet 
at the beginning of SS. An initial window of maximum four packets has been proposed. 
Another proposition, called byte counting, was to account for the number of bytes covered 
by an ACK while increasing the window rather than the number of ACKs. To avoid long 
bursts in the case of large gaps in the ACK stream, a limit on the maximum window increase 
has been proposed (limited byte counting). These solutions try to solve the problem while 
preserving the ACK clock. They result in an increase in TCP burstiness and an overload on 
network buffers. Another type of solution tries to solve the problem by introducing some 
kind of packet spacing (e.g. rate-based spacing). The source transmits directly at a large 
window without overloading the network. Once the large window is reached, the ACK clock 
takes over. This lets the source avoid a considerable part of SS. The problem can be solved 
at the application level without changing the TCP. A possible solution (e.g. XFTP), consists 
of establishing many parallel TCP connections for the same transfer. This accelerates the 
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Figure 9.2 Spoofing: elimination of the long delay link from the feedback loop. 



growth of the resultant window, but increases the aggressiveness of the transfer and hence 
the losses in the network. An adaptive mechanism has been proposed for XFTP to change 
the number of connections as a function of network congestion. Another solution has been 
proposed to accelerate the transfer of web pages. Instead of using an independent TCP 
connection to fetch every object in a page, the client establishes a persistent connection and 
asks the server to send all the objects on it (hypertext transfer protocol, HTTP). Only the 
first object suffers from the long SS phase; the remaining objects are transferred at a high 
rate. The low throughput during SS is compensated for by the long time remaining in CA. 

The problem can be also solved inside the network rather than at hosts, which is worth- 
while when a long delay link is located on the path. In order to decrease the RTT, the long 
delay link is eliminated from the feedback loop by acknowledging packets at the input of 
this link (A in Figure. 9.2). Packets are then transmitted on the long delay link using an 
optimized transport protocol (e.g. STP, described in Henderson and Katz [1]). 

This transport protocol is tuned to quickly increase its transmission rate without the need 
for a long SS. Once arriving at the output (B), another TCP connection is used to transmit the 
packets to the destination. In a satellite environment, the long delay link may lead directly 
to the destination, so another TCP connection is not required. Because packets have already 
been acknowledged, any loss between the input of the link (A) and the destination must be 
locally retransmitted on behalf the source. Also, ACKs from the receiver must be discarded 
silently (at B) so as not to confuse the source. This approach is called TCP spoofing. The 
main gain in performance comes from not using SS on the long delay link. The window 
increases quickly, which improves performance, but spoofing still has many drawbacks. 
First, it breaks the end-to-end semantics of TCP; a packet is acknowledged before reaching 
its destination. Also, it does not work when encryption is accomplished at the IP layer, 
and it introduces a heavy overload on network routers. Further, the transfer is vulnerable to 
path changes, and symmetric paths are required to be able to discard the ACKs before they 
reach the source. Spoofing can be seen as a particular solution to some long delay links. 
It is interesting when the long delay link is the last hop to the destination. This solution is 
often used in networks providing high-speed access to the Internet via geostationary earth 
orbit (GEO) satellite links. 
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9.1.4 Unfairness problem at the TCP layer 

One way to solve the problem is action at the TCP level by accelerating the window 
growth for long RTT connections. An example of a TCP-level solution is, the constant rate 
algorithm. The window is increased in CA by a factor inversely proportional to (RTT) 2 . 
The result is a constant increase rate of the throughput regardless of RTT, thus better 
fairness. The first problem in this proposition is the choice of the increase rate. Also, 
accelerating window growth while preserving the ACK clock results in large bursts for long 
RTT connections. 

Inside the network, fairness is improved by isolating the different connections from 
each other. Given that congestion control in TCP is based on losses, isolation means that 
a congested node must manage its buffer to distribute drops on the different connections 
in such a way that they get the same throughput. Many buffer management policies have 
been proposed. Some of these policies, such as RED (random early detection) [3], drop 
incoming packets with a certain probability when the queue length or its average exceeds 
a certain threshold. This distributes losses on the different connections proportionally to 
their throughput without requiring any per-connection state. However, dropping packets in 
proportion to the throughput does not always lead to fairness, especially if the bottleneck 
is crossed by unresponsive traffic. With a first-in first-out (FIFO) scheduler, the connection 
share of the bandwidth is proportional to its share of the buffer. Better fairness requires 
control of the buffer occupancy of each connection. Another set of policies, known as Flow 
RED, try to improve fairness by sharing the buffer space fairly between active connections. 
This ensures that each connection has at least a certain number of places in the queue, which 
isolates connections sending at small rates from aggressive ones. This improves fairness, 
but at the same time increases buffer management overhead over a general drop policy such 
as RED. 

The problem of fairness has additional dimensions. Solving the problem at the TCP 
level has the advantage of keeping routers simple, but it is not enough given the prevalence 
of non-TCP-friendly traffic. Some mechanisms in network nodes are required to protect 
conservative TCP flows from aggressive ones. Network mechanisms are also required to 
ensure fairness at a level below or above TCP, say at the user or application level. A user 
(e.g. running XFTP) may be unfairly aggressive and establish many TCP connections in 
order to increase its share of the bandwidth. The packets generated by this user must be 
considered by the network as a single flow. This requires an aggregation in flows of TCP 
connections. The level of aggregation determines the level of fairness we want. Again this 
approach requires an additional effort. 

9.1.5 N on congestion losses 

TCP considers the loss of packets as a result of network congestion and reduces its win- 
dow consequently. This results in severe throughput deterioration when packets are lost 
for reasons other than congestion. In wireless communications noncongestion losses are 
mostly caused by transmission errors. A packet may be corrupted while crossing a poor- 
quality radio link. The solutions proposed to this problem can be divided into two main 
categories. The first consists in hiding the lossy parts of the Internet so that only congestion 
losses are detected at the source. The second type of solution consists of enhancing TCP 
with some mechanisms to help it to distinguish between different types of losses. When 
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hiding noncongestion losses these losses are recovered locally without the intervention of 
the source. This can be accomplished at the link or TCP level. Two well as known mech- 
anisms are used as link-level solutions to improve the link quality: ARQ and FEC. These 
mechanisms are discussed in Chapters 2 and 4. 

TCP-level solutions try to improve link quality by retransmitting packets at the TCP 
level rather than at the link level. A TCP agent in the router at the input of the lossy link 
keeps a copy of every data packet. It discards this copy when it sees the ACK of the packet, 
and it retransmits the packet on behalf of the source when it detects a loss. This technique 
has been proposed for terrestrial wireless networks where the delay is not so important as 
to require the use of FEC. The TCP agent is placed in the base station at the entry of the 
wireless network. Two possible implementations of this agent exist. 

The first implementation, referred to as indirect TCP, consists of terminating the origi- 
nating TCP connection at the entry of the lossy link. The agent acknowledges the packets 
and takes care of handing them to the destination. A TCP connection well tuned to a lossy 
environment (e.g. TCP-SACK) can be established across the lossy network. A different 
transport protocol can also be used. This solution breaks the end-to-end semantics of the 
Internet. Also, it causes difficulties during handover since a large state must be transferred 
between base stations. The second implementation (Snoop protocol) respects the end-to- 
end semantics. The intermediate agent does not terminate the TCP connection; it just keeps 
copies of data packets and does not generate any artificial ACK. Nonduplicate ACKs sent 
by the destination are forwarded to the source. Duplicate ACKs are stopped. A packet is re- 
transmitted locally when three duplicate ACKs are received or a local timeout expires. This 
local timeout is set, of course, to a value less than that of the source. As in the link-level case, 
interference may happen between the source and agent mechanisms. In fact, this solution 
is no other than link-level recovery implemented at the TCP level. Again, because it hides 
all losses, congestion losses must not occur between the Snoop agent and the destination. 

9.1.6 End-to-end solutions 

The addition of some end-to-end mechanisms to improve TCP reaction to noncongestion 
losses should further improve performance. Two approaches exist in the literature. The first 
consists of explicitly informing the source of the occurrence of a noncongestion loss via an 
explicit loss notification (ELN) signal. The source reacts by retransmitting the lost packet 
without reducing its window. An identical signal has been proposed to halt congestion 
control at the source when a disconnection appears due to handover in a cellular network. 
The difficulty with such a solution is that a packet corrupted at the link level is discarded 
before reaching TCP, and then it is difficult to get this information. The second approach is to 
improve the congestion control provided by TCP rather than recovery from noncongestion 
losses. We mention it here because it consists a step toward a solution to the problem of 
losses on an end-to-end basis. 

The proposed solutions aim to decouple congestion detection from losses. With some 
additional mechanisms in the network or at the source, the congestion is detected and the 
throughput reduced before the overflow of network buffers. These examples, which will be 
discussed later in more detail, are the Vegas version of TCP [4] and the explicit congestion 
notification proposal. In Vegas, the RTT of the connection and the window size are used 
to compute the number of packets in network buffers. The window is decreased when this 
number exceeds a certain threshold. With ECN, an explicit signal is sent by the routers to 
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indicate congestion to TCP sources rather than dropping packets. If all the sources, receivers 
and routers are compliant (according to Vegas or ECN), congestion losses will considerably 
decrease. The remaining losses could be considered to be caused mostly by problems 
other than congestion. Given that noncongestion losses require only retransmission without 
window reduction, the disappearance of congestion losses may lead to the definition at the 
source of a new congestion control algorithm which reacts less severely to losses. This 
ideal behaviour does ont exist in today's networks. In the absence of any feedback from the 
network as with Vegas, the congestion detection mechanism at the source may fail; here, 
congestion losses are unavoidable. If the source bases its congestion control on explicit 
information from the network as with ECN, some noncompliant routers will not provide 
the source with the required information, dropping packets instead. A reduction of the 
window is necessary in this case. For these reasons, these solutions still consider losses as 
congestion signals and reduce their windows consequently. 

9.1.7 Bandwidth asymmetry 

From the previous discussion we could see that TCP uses the ACK clock to predict what 
is happening inside the network. It assumes implicitly that the reverse channel has enough 
bandwidth to convey ACKs without being disturbed. This is almost true with the so-called 
symmetric networks where the forward and the reverse directions have the same bandwidth. 
However, some of today's networks (e.g. direct broadcast satellite, 4G cellular networks 
and asymmetric digital subscriber loop networks) tend to increase capacity in the forward 
direction, whereas a low-speed channel is used to carry ACKs back to the source. Even if 
ACKs are smaller in size than data packets, the reverse channel is unable to carry the high 
rate of ACKs. The result is congestion and losses on the ACK channel. This congestion 
increases the RTT of the connection and causes loss of ACKs. The increase in RTT reduces 
throughput and increases end-to-end delay. Also, it slows window growth, which further 
impairs performance when operating on a long delay path or in a lossy environment. The 
loss of ACKs disturbs one of the main functionalities of the ACK clock: smoothing the 
transmission. The window slides quickly upon receipt of an ACK covering multiple lost 
ACKs, and a burst of packets is sent, which may overwhelm the network buffers in the 
forward direction. Also, the loss of ACKs slows down the growth of the congestion window, 
which results in poor performance for long delay paths and lossy links. The proposed 
solutions to this problem can be divided into receiver-side solutions, which try to solve the 
problem by reducing the congestion on the return path, and source-side solutions, which 
try to reduce TCP burstiness. The first receiver-side solution is to compress the headers of 
TCP/IP packets on a slow channel to increase its capacity in terms of ACKs per unit of time 
(e.g. SLIP header compression). It profits from the fact that most of the information in a 
TCP/IP header does not change during the connection lifetime. The other solutions propose 
reducing the rate of ACKs to avoid congestion. The first proposition is to delay ACKs at the 
destination. An ACK is sent every d packets, and an adaptive mechanism has been proposed 
to change d as a function of the congestion on the ACK path. Another option is to keep the 
destination unchanged and filters ACKs at the input of slow channel. When an ACK arrives, 
the buffer is scanned to see if another ACK (or a certain number of ACKs) of the same 
connection is buffered. If so, the new ACK is substituted for the old one. ACKs are filtered 
to match their rates to the rate of the reverse channel. Normally, in the absence of artificial 
filtering, ACKs are filtered sometime later when the buffer gets full. The advantage of this 
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solution is that the filtering is accomplished before the increase in RTT. Solutions at the 
sender side which reduce the burstiness of TCP are also possible. Note that this problem 
is caused by the reliance of TCP on the ACK clock, and it cannot be completely solved 
without any kind of packet spacing. 

First, a limit on the size of bursts sent by TCP is a possible solution. However, with 
systematic loss of ACKs, limiting the size of bursts limits the throughput of the connection. 
Second, it is possible to reconstruct the ACK clock at the output of the slow channel. When 
an ACK arrives at that point, all the missing ACKs are generated, spaced by a time interval 
derived from the average rate at which ACKs leave the slow channel. This reconstruction 
may contain a solution to this particular problem. However, the general problem of TCP 
burstiness upon loss of ACKs will still remain. 



9.2 TCP OPERATION AND PERFORMANCE 

The TCP protocol model will only include the data transfer part of the TCP. Details of the 
TCP protocol can be found in the various Internet requests for comments (RFCs; see also 
Stevens [6]). The versions of the TCP protocol that we model and analyze in this section 
all assume the same receiver process. The TCP receiver accepts packets out of sequence 
number order, buffers them in a TCP buffer, and delivers them to its TCP user in sequence. 
Since the receiver has a finite resequencing buffer, it advertises a maximum window size 
W m - ix at connection setup time, and the transmitter ensures that there is never more than 
this amount of unacknowledged data outstanding. We assume that the user application at 
the TCP receiver can accept packets as soon as the receiver can offer them in sequence 
and, hence, the receiver buffer constraint is always just W m . dx . The receiver returns an ACK 
for every good packet that it receives. An ACK packet that acknowledges the first receipt 
of an error-free in-sequence packet will be called a first ACK. The ACKs are cumulative, 
i.e. an ACK carrying the sequence number n acknowledges all data up to, and including, 
the sequence number n — 1 . If there is data in the resequencing buffer, the ACKs from the 
receiver will carry the next expected packet number, which is the first among the packets 
required to complete the sequence of packets in the sequencing buffer. Thus, if a packet is 
lost (after a long sequence of good packets), then the transmitter keeps getting ACKs with 
the sequence number of the first packet lost, if some packets transmitted after the lost packet 
do succeed in reaching the receiver. These are called duplicate ACKs. 

9.2.1 The TCP transmitter 

At all times t, the transmitter maintains the following variables for each connection: 

(1) A(t) the lower window edge. All data numbered up to and including A(t) — 1 has 
been transmitted and ACKed. A(t) is nondecreasing; the receipt of an ACK with 
sequence number n > A(t) causes A(t) to jump to n. 

(2) W(t) the congestion window. The transmitter can send packets with the sequence 
numbers n, A{t) <n< A(t) + W(t) where W(t) < W max (r) and W(t) increases or 
decreases as described below. 

(3) W t h(t)— the slow-start threshold controls the increments in W(t ) as described below. 
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9.2.2 Retransmission timeout 

The transmitter measures the RTTs of some of the packets for which it has transmitted and 
received ACKs . These measurements are used to obtain a running estimate of the packet RTT 
on the connection. Each time a new packet is transmitted, the transmitter starts a timeout 
timer and resets the already running timeout timer, if any; i.e. there is a timeout only for 
the last transmitted packet. The timer is set for a retransmission timeout (RTO) value that 
is derived from the RTT estimation procedure. The TCP transmitter process measures time 
and sets timeouts only in multiples of a timer granularity. Further, there is a minimum 
timeout duration in most implementations. We will see in the analysis that coarse timers 
have a significant impact on TCP performance. For details on RTT estimation and the setting 
of RTO values, see Reference [6] or [8]. 

9.2.3 Window adaptation 

The basic algorithm is common to all TCP versions [11]. The normal evolution of the 
processes A(t), W(t) and W^t) is triggered by first ACKs (see definition above) and 
timeouts as follows. 

(1) Slow start: if W(t) < W ± (t), each first ACK increments W(t) by one. 

(2) Congestion avoidance: if W(t) > W t h (t), each first ACK increments W(t) by 
l/W(t). 

(3) Timeout at epoch t sets W(t + ) to one, W t h(t + ) to [~(W(f)/2)l and retransmissions 
begins from A(t). 

9.2.4 Packet loss recovery 

If a packet is lost, A(t ) and W(t) will continue to be incremented until the first ACK for 
the packet just before the lost packet is received. For a particular loss instance, let their 
final values be denoted by A and M, respectively; we will call M a loss window. Then the 
transmitter will continue to send packets up to the sequence number A + M — 1. If some 
of the packets sent after the lost packet get through, they will result in duplicate ACKs, all 
carrying the sequence number A. The last packet transmitted (i.e. A + M — 1) will have 
an RTO associated with it. The TCP versions differ in the way they recover from loss. We 
provide some details here; later we will provide more detail on modeling these recovery 
procedures. 

9.2.5 TCP-OldTahoe (timeout recovery) 

The transmitter continues sending until packet number A + M — 1 and then waits for a 
coarse timeout. 

9.2.6 TCP-Tahoe (fast retransmit [9]) 

A transmitter parameter K is used, a small positive integer; typically K — 3. If the transmit- 
ter receives the ^Tth duplicate ACK at time t (before the timer expires), then the transmitter 
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behaves as if a. timeout has occurred and begins retransmission, with W(t + ) and W t h(t + ) as 
given by the basic algorithm. 

9.2.7 TCP-Reno fast retransmit, fast (but conservative) recovery [6] 

Fast-retransmit is implemented, as in TCP-Tahoe, but the subsequent recovery phase is 
different. Suppose the Kth duplicate ACK is received at the epoch t 0 . Loss recovery then 
starts. Bearing in mind the definitions of A and M above, the transmitter sets 

W(t+) = \M/1\ + K WWtf ) = \M/2\ (9.1) 

The addition of K takes care of the fact that more packets have successfully left the network. 
The Reno transmitter then retransmits only the packet with sequence number A. If only one 
packet is lost then this retransmission will produce an ACK for all of the other packets, 
whereas if more packets are lost we had better be sure that we are not really experiencing 
congestion loss. For the z'th duplicate ACK received, at say t ack , until recovery completes, 

W(C k O = W(t. dck i) + 1 (9.2) 

Further details of the Reno recovery will be explained by using an example from Kumar [13], 
where K — 3. Suppose A — 15, M — 16 and packet 15 is lost. The transmitter continues 
sending packets 16, 17, 18, ... , 30; suppose packet 20 is also lost. The receiver returns 
ACKs (all asking for packet 15) for packets 16, 17, 18, 19, 21, 29 and 30. Note that the 
ACK for packetl4 would have been the first ACK asking for packet 15. When the ACK for 
packet 18 is received (i.e. the third duplicate ACK), the transmitter sets 

W(t+) = \M/2\ +K= [16/21 +3=11 
W th (t+) = TM/21 = ri6/21 = 8 

A is still 15; thus, packet 15 is retransmitted. Meanwhile, ACKs for packets 19, 21, . . . , 30 
are also received and based on Equation (9.2) W grows to 1 1 + 1 1 = 22. Since A — 75, 
with W — 22, the transmitter is allowed to send packets 15-36; hence, retransmission of 
packet 15 is followed by transmission of packets 31-36. Receipt of packet 15 results in a 
first ACK asking for packet 20 (thus first-ACKing packets 15-19) and A jumps to 20. This 
is called a partial ACK since all of the packets transmitted in the original loss window were 
not ACKed. 

If packets 3 1-36 succeed in getting through, then three duplicate ACKs asking for packet 
20 are also obtained, and 20 is retransmitted. This results in a first ACK that covers all of the 
packets up to packet number 36. At this time t\, the congestion window is reset as follows 
and a new transmission cycle starts: 

W{t+) = [M/2] W ± (t+) = [M/2-] (9.3) 

Thus, Reno slowly recovers the lost packets and there is a chance that, owing to insufficient 
duplicate ACKs, the recovery stalls and a timeout has to be waited for. After a timeout, the 
basic timeout recovery algorithm is applied. 



270 ADAPTIVE TCP LAYER 



9.2.8 TCP-NewReno (fast retransmit, fast recovery [9]) 

When duplicate ACKs are received, the first lost packet is resent, but, unlike Reno, upon 
receipt of a partial ACK after the first retransmission, the next lost packet (as indicated 
by the partial ACK number) is retransmitted. Thus, after waiting for the first duplicate 
ACKs, the remaining lost packets are recovered in as many RTTs. If less than duplicate 
ACKs are received, then a timeout is inevitable. Consider the following example [13] to see 
the difference with Reno. Suppose that after a loss, A — 1 and M — 8, packets 7, 8, . . . , 14 
are sent, and packets 7, 8 and 1 1 are lost. The transmitter receives three duplicate ACKs for 
packets 9, 10 and 12 (asking for packet 7). A fast retransmit is done (the same as in Reno), 
i.e. W — 4 + 3 = 7, W± — 4 , and packet 7 is sent. The two ACK's for packets 13 and 14 
cause W to become 9 [see Equation (9.2)]. Assuming that packet 7 now succeeds, its ACK 
(the first ACK asking for 8) would make A equal 8; the transmitter can now send packets 
15 and 16 also. NewReno would now resend packet 8, whereas Reno would wait for three 
duplicate ACKs; these cannot come since only two more packets have been sent after the 
retransmission of packet 7. Thus, in case of multiple losses, Reno has a higher probability 
of resorting to a coarse timeout. 

9.2.9 Spurious retransmissions 

Consider TCP-OldTahoe. The retransmission of the first lost packet may result in an ACK 
that acknowledges all of the packets until the next packet lost in the loss window. This 
would advance the lower window edge A(t); the congestion window would be increased to 
two and the next lost packet and its successor would be transmitted, even if this successor 
packet had gotten through in its first transmission. Thus, some of the good packets in the 
loss window may be retransmitted when retransmission starts; this phenomenon can be seen 
in the sample path fragments shown in Fall and Floyd [9]. 

9.2.1 0 Modeling of TCP operation 

The model from Figure. 9. 3, motivated by many experimental studies of TCP performance 
over wireless mobile links [7, 10, 13], is used the most often. In this section the additional 
issue of mobility is not considered [7] ; hence, we refer to the wireless link as simply a 'lossy' 
link. We model only one direction of flow of packets from the LAN host (or base station in 
cellular system) to the mobile terminal. In this case propagation delays can be neglected. 
The transmission time of a TCP packet from the LAN host (respectively, the lossy link) 
is assumed to be exponentially distributed with mean \~ l (respectively, fi" 1 ). By taking 
IX = 1, time is normalized to the mean packet transmission time on the lossy link. During a 
bulk transfer over a TCP connection, we would expect that the packets would predominantly 
be of a fixed length. However, a MAC layer (or radio resource manager) will operate in 
these systems too. Thus, the randomness in packet transmission times in the model can 
be taken in to account for the variability in the time taken to transmit a head-of-the-line 
packet at the transmitter queues. The exponential assumption yields Markov processes and, 
hence, justifies the use of tools from Chapter 6. Performance analysis of TCP protocols 
under the above assumptions can be found in Kumar [13]. A sample result is shown in 
Figure 9.4. 
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Figure 9.3 Model for the TCP connection. X(t) is the number of TCP packets queued at 
the intermediate system (IS) at time t. 




10 3 10 2 10 1 

Packet error probability 



Figure 9.4 Throughput of versions of TCP vs packet-loss probability; k — 5/x; K is the 
fast-retransmit threshold. (Reproduced by permission of IEEE [13].) 

9.3 TCP FOR MOBILE CELLULAR NETWORKS 

Many papers have been written proposing methods for improving TCP performance over a 
wireless link [14-24] . Most of these papers have, however, concentrated on only one problem 
associated with wireless links, a perceived high BER over the wireless link. While a high 
BER has significant implications for protocol performance, other limitations of the wireless 
environment are equally or even more important than high BER. In this section we discuss 
the effects of periodic disconnections on TCP performance and present a protocol (M-TCP) 
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that successfully deals with this problem. The protocol is also capable of on-the-fly data 
compression in the event that the wireless bandwidth available is very small. Finally, the 
protocol also maintains end-to-end TCP semantics. 

If we use TCP without any modification in mobile networks, we experience a serious 
drop in the throughput of the connection. There are several reasons for such a drastic drop 
in TCP throughput. 

(1) The wireless link might suffer from a high bit error rate. While this may be true 
in some cases, the bit error can be reduced by one or two orders of magnitude by 
the use of appropriate FEC codes and retransmission schemes at the link layer, see 
Chapter 4. However, let us assume for the sake of this discussion that the wireless 
link is susceptible to inordinately high bit error rates. Bit errors cause packets to 
become corrupted, which results in lost TCP data segments or acknowledgements. 
When acknowledgements do not arrive at the TCP sender within a short amount 
of time (the retransmit timeout or RTO which is a multiple of half a second), the 
sender retransmits the segment, exponentially backs off its retransmit timer for the 
next retransmission, and closes its congestion window to one segment. Repeated 
errors will ensure that the congestion window at the sender remains small, resulting 
in low throughput [15]. It is important to note that FEC may be used to combat 
high BER, but it will waste valuable wireless bandwidth when correction is not 
necessary. 

(2) In a mobile environment, as a user moves between cells, there is a brief blackout 
period (or disconnection) while the mobile performs a handoff with the new access 
point. Disconnections may also be caused by physical obstacles in the environ- 
ment that block radio signals, such as buildings. Disconnection periods can be of 
the order of several seconds, causing packet loss or delay in the transmission of 
acknowledgements of received packets. These disconnections result in lost data seg- 
ments and lost ACKs which, in turn, result in the TCP sender timing out and closing 
its congestion window, thus greatly reducing the efficiency of the connection. Since 
these disconnections tend to be fairly lengthy, forward error correction schemes are 
ineffective. 

(3) It is likely that, in order to provide high-bandwidth wireless connections, cell sizes 
in 4G systems will have to be reduced. Small cell sizes unfortunately result in small 
cell latencies that, in turn, cause frequent disconnections as a user roams. All the 
problems that result from disconnections, as we discussed above, occur more often 
here. 

Another problem caused by small cell latencies and frequent disconnections is that 
of serial timeouts at the TCP sender. A serial timeout is a condition wherein multiple 
consecutive retransmissions of the same segment are transmitted to the mobile while it is 
disconnected. All these retransmissions are thus lost. Since the retransmission timer at the 
sender is doubled with each unsuccessful retransmission attempt (until it reaches 64 s), 
several consecutive failures can lead to inactivity lasting several minutes. Thus, even when 
the mobile is reconnected, no data is successfully transmitted for as long as 1 min. The 
serial timeouts at the TCP sender can prove to be even more harmful to overall throughput 
than losses due to bit errors or small congestion windows. 
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9.3.1 Improving TCP in mobile environments 

In order to ensure that the TCP connection to a mobile is efficient, it is necessary to prevent 
the sender from shrinking its congestion window when packets are lost either due to bit- 
error or to disconnection. Furthermore, it is important to ensure that, when the mobile 
is reconnected, it begins receiving data immediately (rather than having to wait for the 
sender to timeout and retransmit). As we saw earlier, the sender may shrink its congestion 
window in response to a timeout (caused by lost packets either due to a high BER or 
disconnections) or in response to duplicate ACKs. As we already discussed in Section 9.1 
some solutions attempt to keep the sender's congestion window open by introducing a host 
in the fixed network who 'spoofs' the sender into thinking everything is fine on the wireless 
link. Unfortunately, however, these solutions do not work under all scenarios of mobility. 
Specifically, they all perform poorly when faced with frequent or lengthy disconnections. 
Furthermore, some solutions fail to maintain end-to-end TCP semantics. 

One proposed solution for losses caused by high BER is the Berkeley Snoop Module 
[16]. The snoop module resides at an intermediate host near the mobile user (typically 
the base station). It inspects the TCP header of TCP data packets and acknowledgements 
which pass through and buffers copies of the data packets. Using the information from 
the headers, the snoop module detects lost packets (a packet is assumed to be lost when 
duplicate acknowledgements are received) and performs local retransmissions to the mobile. 
The module also implements its own retransmission timer, similar to the TCP retransmission 
timeout, and performs retransmissions when an acknowledgement is not received within this 
interval. An improved version of the snoop module adds selective retransmissions from the 
intermediate node to the mobile. Another solution to the problem caused by high BER is 
the I-TCP [14] protocol (indirect-TCP). In the I-TCP protocol a TCP connection between 
a fixed host and a Mobile Host (MH) is split in two at the Mobile Support Station (MSS) or 
base station. Data sent to the MH is received and ACKed by the MSS before being delivered 
to the MH. Note that on the connection between the MSS and MH it is not necessary to use 
TCP, rather some protocol optimized for the wireless link could be used. 

A solution that addresses the problem caused by short disconnections (where one or two 
segments only are lost) is presented in [21]. It, however, does not split the TCP connection. 
The solution is based on the following observation: during a handoff, since the MH cannot 
receive packets, unmodified TCP at the sender will think a congestion has occurred and will 
begin congestion control (reduce window size and retransmit) after a timeout. The timeout 
period is long and even though the MH may have completed the handoff it will have to 
wait for the full timeout period before it begins receiving packets from the sender. The 
fast retransmit idea presented in Caceres and Iftode [21] forces the MH to retransmit, in 
triplicate, the last old ACK as soon as it finishes a handoff. This forces the sender to reduce 
the congestion window to a half and retransmit one segment immediately. 

9.3.2 Mobile TCP design 

The implementation of M-TCP is influenced, to some degree, by the mobile network archi- 
tecture. In this section we assume a three-level hierarchy. At the lowest level are the mobile 
hosts that communicate with MSS nodes in each cell. Several MSSs are controlled by a 
supervisor host (SH). The SH is connected to the wired network and it handles most of the 
routing and other protocol details for the mobile users. In addition it maintains connections 
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for mobile users, handles flow-control and is responsible for maintaining the negotiated 
quality of service. These SHs thus serve the function of gateways. 

The design of transport layer is influenced by a number of constraints unique to the 
mobile environment: 

• Available bandwidth within a cell may change dynamically. This leads to difficulties in 
guaranteeing QoS parameters such as delay bounds and bandwidth guarantees. 

• Mobile hosts frequently encounter extended periods of disconnection (due to handoff or 
physical interference with the signal), resulting in significant loss of data, causing poor 
TCP and UDP throughput. 

• Mobile devices are battery powered and hence power is a scarce resource. Protocols 
designed for use on mobile platforms must therefore be tailored to be power-efficient. 

All of these factors point towards using transport protocols which are optimized specifically 
for the wireless link. It is not reasonable, however, to assume that the entire installed 
network base will (or should) upgrade to these mobile-friendly protocols as they may never 
have occasion to communicate with a mobile user. Hence, the obvious solution is to split 
transport connections in two. The existing protocols may continue to be used on the fixed 
network, and the protocols optimized for the wireless environment may be used in the mobile 
subnetworks. In the following, all connections set up by an MH, where the other endpoint 
is either another MH or is a fixed host, are split at the SH. Thus, the service-provider sets 
up a connection with the SH assuming the SH is the other end-point of the connection. The 
SH sets up another connection to the MH. This allows us to address the problems listed 
above as follows: 

• A bandwidth management module at the SH [19] assigns a fixed amount of bandwidth 
to each connection (this is recomputed periodically based on the changing needs of 
other MHs) and ensures that data is transmitted to all MHs at their assigned rate. This 
mechanism allows implementation of some limited form of QoS guarantees. 

• In this architecture, the SH performs local error recovery to combat the efficiency prob- 
lems resulting from loss over the wireless link. 

• The SH tracks mobiles as they roam and uses this information to ensure that the number of 
duplicate packets transmitted to the mobile are kept small. Since every packet received by 
the MH has to be processed, consuming battery power, keeping the number of duplicates 
small reduces power consumption at the MH. 

The goal in developing the M-TCP protocol is to provide a general solution to the problem of 
improving TCP's efficiency for mobile computing applications. Specifically, the following 
characteristics are desired: 

(1) improve TCP performance for mobile clients; (2) maintain end-to-end TCP seman- 
tics; (3) be able to deal with the problems caused by lengthy disconnections or by 
frequent disconnections; (4) adapt to dynamically changing bandwidth over the al- 
ready starved wireless link; (5) ensure that handoffs (as a mobile roams) are efficient 

To this end we chose to use the split connection approach for implementing M-TCP 
because it fits in well with our general design philosophy articulated in the previous section 
and because it allows us to modify TCP on the mobile network to make it respond better to 
disconnections and low (or varying) wireless bandwidth. 
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Figure 9.5 M-TCP connection. 

So, every TCP connection is split in two at the SH. The TCP sender on the fixed network 
uses unmodified TCP to send data to the SH while the SH uses M- version of TCP for 
delivering data to the MH. Figure 9.5 illustrates the way in which a TCP connection is split. 
The TCP client at the SH (called SH-TCP) receives segments transmitted by the sender 
and passes these segments to the M-TCP client for delivery to the MH. ACKs received by 
M-TCP at the SH are forwarded to SH-TCP for delivery to the TCP sender. In the remainder 
of this section we discuss the behavior of SH-TCP and M-TCP in detail. Before doing so, 
however, we briefly discuss our assumptions regarding the mobile networking environment. 

Wireless bandwidth will always be a precious resource and we therefore believe that 
it ought to be allocated to users based on their need. In this architecture, we use a band- 
width management module [19], that resides at the SH, to perform this task. This module 
determines the amount of bandwidth to be allocated to each connection within each cell 
(a discussion of the possible implementation of this module is given in Chapter 12). Thus, 
when a mobile opens a TCP connection, it is allocated a fixed amount of bandwidth (this 
amount may change periodically depending on the needs of other mobiles and the current 
status of this mobile's connection). What implication does this have for the design of TCP? 
Clearly, since the bandwidth allocated to the mobile is fixed, there is little reason to im- 
plement TCP's sophisticated congestion control mechanisms between the SH and MH. A 
second assumption we make in this development is that the BER visible at the transport 
layer will be small. This assumption is based on the fact that the physical layer will keep 
the BER over wireless channels, in a range close to 10~ 5 . Thus, it is necessary to design a 
TCP protocol that handles unremedied problems, disconnections and low bandwidth rather 
than a protocol that works well only in high BER environments. 

9.3.3 The SH-TCP client 

When SH-TCP receives a segment from the TCP sender, it passes the segment on to the 
M-TCP client. However, unlike I-TCP, it does not ACK this data until the MH does. 
SH-TCP is notified of MH ACKs by the M-TCP client running at the SH. It is easy to 
see that this behavior ensures that end-to-end TCP semantics are maintained. However, 
how do we ensure that the sender does not go into congestion control when ACKs do not 
arrive (because the MH was temporarily disconnected and did not receive the data, or could 
not send the ACKs)? The approach is to choke the TCP sender when the MH is disconnected, 
and allow the sender to transmit at full speed when the MH reconnects by manipulating the 
TCP sender's window. Specifically: let W denote the currently advertised receive window at 
SH-TCP. Say the window contains w < Wbytes. Assume that the MH has ACKed bytes up 
to w' < w. SH-TCP sends an ACK (or ACK) for bytes up to w' — 1 in the normal way. As 
and when the MH ACKs more data, more ACKs are generated but one last byte is always 
left unacknowledged. 
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Say the MH disconnects after having ACKed bytes up to w'. The M-TCP client assumes 
that the MH has been temporarily disconnected because it stops receiving ACKs for bytes 
transmitted after w' . M-TCP sends an indication of this fact to SH-TCP who then sends an 
ACK for the w'th byte to the sender. This ACK will also contain a TCP window size update 
that sets the sender's window size to zero. When the TCP sender receives the window update, 
it is forced into persist mode. While in this state, it will not suffer from retransmit timeouts 
and will not exponentially back off its retransmission timer, nor will it close its congestion 
window. RFC 1 122 states that TCP clients go into persist mode when the window is shrunk 
to zero. However, in order to shrink the window to zero, the receiver must send a new ACK. 
To grow a window, however, an old ACK can be retransmitted. Note that as long as the 
SH-TCP sends ACKs for the persist packets sent by the TCP source, the state of the sender 
does not change no matter how long the disconnection period lasts. 

If the MH has not disconnected but is in a cell with very little available bandwidth, 
SH-TCP still sends an ACK for byte w' with a window size set to 0. SH-TCP estimates 
the RTT to the TCP sender and estimates the RTO interval. It uses this information to 
preemptively shrink the sender's window before the sender goes into exponential backoff 
(to implement this scheme, a timer at the SR is maintained that is initialized to this estimated 
RTO value). 

When an MH regains its connection, it sends a greeting packet to the SH. M-TCP is 
notified of this event and it passes on this information to SH-TCP which, in turn, sends 
an ACK to the sender and reopens its receive window (and hence the sender's transmit 
window). The window update allows the sender to leave persist mode and begin sending 
data again. Since the sender never timed out, it never performed congestion control or slow- 
start. Thus the sender can resume transmitting at full-speed! The sender will begin sending 
from byte w + 1 here, although possibly duplicating previous sends, so we do not want to 
shrink the window more often than necessary. 

A potential problem with this protocol is the following: say the sender only sends data to 
the MH occasionally. Specifically, if the sender only needed to send bytes 0-50 to the MH, 
according to the above protocol, SH-TCP will ACK bytes up to 49 but will not ACK byte 50. 
This will cause the sender to timeout and retransmit this byte repeatedly. Eventually, after 
12 retransmissions, will the TCP sender give up and quit? This is clearly an undesirable 
situation. This problem is handled by not allowing SH-TCP to shrink the sender's window 
if it believes that there will be no more new ACKs from the MH that will allow it to open 
up the sender's window again. Thus, when SH-TCP thinks that the sender will timeout (and 
if the MH is in a cell with plenty of available bandwidth), it simply ACKs the last byte. At 
this point, there will be no saved byte at SH-TCP to allow it to shrink the sender's window. 
Observe that this is not a problem because the sender did not really have any new segments 
to send to the MH. As and when the sender does transmit new data to the MH, SH-TCP 
reverts to its previously described behavior. 



9.3.4 The M-TCP protocol 

The goal in designing SH-TCP was to keep the TCP sender's congestion window open in 
the face of disconnection events at the MH. On the wireless side, on the other hand, the goal 
is to be able to recover quickly from losses due to disconnections, and to eliminate serial 
timeouts. 
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In designing M-TCP, a scheme similar to SH-TCP is used except that there is more design 
freedom since the protocol at both ends of M-TCP can be modified. M-TCP at the mobile 
receiver is very similar to standard TCP except that it responds to notifications of wireless 
link connectivity. When M-TCP at the MH is notified (by the communications hardware) 
that the connection to its MSS has been lost, it freezes all M-TCP timers. This essentially 
ensures that disconnections do not cause the MH's M-TCP to invoke congestion control. 
Observe that neither acknowledgements nor data are lost during a disconnection. When the 
connection is regained, M-TCP at the MH sends a specially marked ACK to M-TCP at 
the SH which contains the sequence number of the highest byte received thus far. It also 
unfreezes M-TCP timers to allow normal operation to resume. 

At the SH we need to ensure that M-TCP monitors link connectivity and takes appro- 
priate action either when a disconnection occurs or when a reconnection occurs. How does 
M-TCP determine that the MH has been disconnected? In the design M-TCP monitors the 
flow of ACKs from the MH in response to segments transmitted on the downlink. If it does 
not receive an ACK for some time, it automatically assumes that the MH has been discon- 
nected. It then informs SH-TCP of this fact and SH-TCP reacts by putting the TCP sender 
in persist mode. M-TCP at the SH knows when the MH reconnects because M-TCP at the 
MH retransmits the last ACK, and marks it as a reconnection ACK, on being reconnected. 
M-TCP responds by informing SH-TCP who reopens the sender's transmit window. This 
behavior is based on the assumption that the bandwidth management module at the SH 
assigns bandwidth to each connection and regulates its usage so there is no need to invoke 
congestion control at the SH when ACKs are not received from the MH. M-TCP at the SH 
works as follows. 

When a retransmit timeout occurs, rather than retransmit the segment and shrink the 
congestion window, we force M-TCP into persist mode. It is assumed that the absence of an 
ACK means that the mobile receiver has temporarily disconnected, hence retransmissions 
are futile until the MH notifies the SH that it has reconnected. 

At first glance the implementation may appear to have a problem when operating in 
high BER environments. This is because a significant proportion of packet loss in such 
environments will be due to the high BER and not due to disconnections. In response to 
this observation the following arguments are used: 

• It is believed that in most mobile environments link layer solutions will ensure that the 
bit error seen at the transport layer is small. 

• Even if the mobile environment does have high BER, the M-TCP sender will come out 
of persist mode quickly and resume regular data transmissions. This is because, when in 
persist mode, M-TCP will send persist packets to the MH who will be forced to respond 
when it receives these packets. Typically, the persist packets are generated at increas- 
ing intervals starting at 5 s and doubling until the interval becomes 60 s. The interval 
could be changed to be equal to the RTT between the SH and the MH to ensure earlier 
recovery. 

When the SH receives a specially marked reconnection ACK, it moves back out of persist 
mode and retransmits all packets greater than the ACKed byte as these must have been lost 
during the disconnection. If the SH misinterpreted a lengthy delay as a disconnection, the 
MH will eventually ACK the transmitted packets, which will also move the SH M-TCP out 
of persist but will not cause all packets to be retransmitted. 
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Figure 9.6 Experimental M-TCP connection setup. 



9.3.5 Performance examples 

Experimental/simulation set up is shown in Figure 9.6. The system parameters are used as 
in [20]. The wireless link has capacity of 32 kbit. File transfer times for 500 K and 1 MB 
files (respectively) vs disconnection length is given in Figures 9.7 and 9.8. Latency and 
disconnection length were normally distributed random variables and the TCP sender was 
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Figure 9.7 M-TCP/TCP vs disconnection length performance - 5 hops TCP connection. 
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Figure 9.8 M-TCP/TCP vs disconnection length performance - 15 hops TCP connection. 

five hops (Figure 9.7) and 15 hops (Figure 9.8) away. We can see that M-TCP offers better 
performance. 



9.4 RANDOM EARLY DETECTION GATEWAYS 
FOR CONGESTION AVOIDANCE 



In this section we discuss random early detection gateways for congestion avoidance in 
packet-switched networks [25]. RED gateways are designed to accompany a transport-layer 
congestion control protocol. The gateway detects incipient congestion by computing the 
average queue size. The gateway could notify connections of congestion either by dropping 
packets arriving at the gateway or by setting a bit in packet headers. When the average 
queue size exceeds a preset threshold, the gateway drops or marks each arriving packet with 
a certain probability, where the exact probability is a function of the average queue size. 
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Prior to RED, early random drop gateways have been studied as a method for providing 
congestion avoidance at the gateway. The initial works proposed gateways to monitor the 
average queue size to detect incipient congestion and to randomly drop packets when 
congestion is detected. RED gateways differ from the earlier early random drop gateways 
in several respects: (1) the literate queue size is measured; (2) the gateway is not limited to 
dropping packets; and (3) the packet-marking probability is a function of the average queue 
size. 

With an even earlier version called random drop gateways, when a packet arrives at the 
gateway and the queue is full, the gateway randomly chooses a packet from the gateway 
queue to drop. In the implementation of early random drop gateways, if the queue length 
exceeds a certain drop level, then the gateway drops each packet arriving at the gateway with 
a fixed drop probability/?. Even then, it was stressed that, in future implementations, the drop 
level and drop probability should be adjusted dynamically depending on network traffic. 
With drop tail gateways, each congestion period introduces global synchronization in the 
network. When the queue overflows, packets are often dropped from several connections, 
and these connections decrease their windows at the same time. This results in a loss of 
throughput at the gateway. 



9.4.1 The RED algorithm 

The RED gateway calculates the average queue size W q . The average queue size is compared 
with a minimum (W~) and a maximum (W+) threshold. When the average queue size is 
less than the minimum threshold, no packets are marked. When the average queue size is 
greater than the maximum threshold, every arriving packet is marked. If marked packets 
are, in fact, dropped or if all source nodes are cooperative, this ensures that the average 
queue size does not significantly exceed the maximum threshold. When the average queue 
size is between the minimum and maximum thresholds, each arriving packet is marked with 
probability p°, where p° is a function of the average queue size Wq. Each time a packet 
is marked, the probability that a packet is marked from a particular connection is roughly 
proportional to that connection's share of the bandwidth at the gateway. 

Thus, the RED gateway has two separate algorithms. The algorithm for computing 
the average queue size determines the degree of burstiness that will be allowed in the 
gateway queue. The algorithm for calculating the packet-marking probability determines 
how frequently the gateway marks packets, given the current level of congestion. The goal 
is for the gateway to mark packets at fairly evenly spaced intervals, in order to avoid biases 
and avoid global synchronization, and to mark packets sufficiently frequently to control the 
average queue size. 

The gateway's calculations of the average queue size take into account the period when the 
queue is empty (the idle period) by estimating the number m of small packets that could have 
been transmitted by the gateway during the idle period. After the idle period, the gateway 
computes the average queue size as if m packets had arrived to an empty queue during that 
period. As W q varies from W~ to , the packet-marking probability p\, varies linearly from 
0 to its maximum value P: p b «— P(W q — W~)/(W+ — W~). The final packet-marking 
probability p' d increases slowly as the count c increases since the last marked packet: p\, «— 
Pb/ (1 — cpb)- This ensures that the gateway does not wait too long before marking a packet. 
The gateway marks each packet that arrives at the gateway when Wq > W+. 
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9.4.2 Performance example 

In this section we pay attention on the impact of traffic bursteness on the system performance. 
Bursty traffic at the gateway can result from an FTP connection with a long delay-bandwidth 
product but a small window; a window of traffic will be sent, and then there will be a delay 
until the ACK packets return and another window of data can be sent. Variable-bit rate 
video traffic and some forms of interactive traffic are other examples of bursty traffic seen 
by the gateway. This section shows that, unlike drop tail or random drop gateways. RED 
gateways do not have a bias against bursty traffic. For simulation FTP connections with 
infinite data, small windows and small RTT are used to model the less bursty traffic and 
FTP connections with smaller windows and longer RTT to model the more bursty traffic, 
as shown in Figure 9.9. Connections 1-4 have a maximum window of 12 packets, while 
connection 5 has a maximum window of eight packets. Because node 5 has a large RTT and 
a small window, node 5 packets often arrive at the gateway in a loose cluster. By this, we 
mean that, considering only node 5 packets, there is one long inter-arrival time and many 
smaller inter- arrival times. The same simulation scenario is used for example in Floyd and 
Jacobson [25]. 

Figure 9.10 shows the simulation results of the network in Figure 9.7 with drop tail, 
random drop and RED gateways, respectively. The simulations in Figure 9.10 (a) and (b) 
were run with the buffer size ranging from eight to 22 packets. The simulations in Figure 
9.10 (c) were run many times with a minimum threshold ranging from three to 14 packets 
and a buffer size ranging from 12 to 56 packets. 

For Figure 9.10 (a) and (b), the x-axis shows the buffer size, and the v-axis shows 
node 5's throughput as a percentage of the total throughput through the gateway. In these 
simulations, the concern is to examine the gateway's bias against bursty traffic. 

For the drop tail gateway, Figure 9.11(a) shows the average queue size (in packets) 
seen by arriving packets at the bottleneck gateway, and Figure 9.11(b) shows the average 
link utilization on the congested link. Because RED gateways are quite different from drop 
tail or random drop gateways, the gateways cannot be compared simply by comparing 




Figure 9.9 A simulation network with five FTP connections. 



282 ADAPTIVE TCP LAYER 





Figure 9.10 Performance of drop tail, random drop and RED gateways. 

the maximum queue size. A fair comparison is between a drop rail gateway and an RED 
gateway that maintains the same average queue size. With drop tail or random drop gateways, 
the queue is more likely to overflow when the queue contains some packets from node 
5. In this case, with either random drop or drop tail gateways, node 5 packets have a 
disproportionate probability of being dropped; the queue contents when the queue overflows 
are not representative of the average queue contents. Figure 9.10 (c) shows the result of 
simulations with RED gateways. The Jt-axis shows W~, and the y-axis shows node 5's 
throughput. The throughput for node 5 is close to the maximum possible throughput given 
node 5's round trip time and maximum window. The parameters for the RED gateway are as 
follows: W q <— (1 — ft) q ) W q + a> q W where a> q represents the time constant of the low pass 
averaging filter and W is the current value of the queue, a> q = 0.002 and P — 1 /50 were 
chosen. In addition = 2W q , and the buffer size (which ranges from 12 to 56 packets) 
is4W q . 



9.5 TCP FOR MOBILE AD HOC NETWORKS 

So far we have discussed in this chapter problems of TCP design when dealing with conges- 
tion, errors in the transmissions, and disconnections due to mobility and fading. Transport 
connections set up in wireless ad hoc networks are even further complicated by problems 
such as high bit error rates, frequent route changes, and partitions. If we run TCP over such 
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Figure 9.1 1 Drop tail gateway average queue length and link utilization. 

connections, the throughput of the connection is observed to be extremely poor because 
TCP treats lost or delayed acknowledgments as congestion. In this section, we present an 
approach where a thin layer between Internet protocol and standard TCP is inserted that 
corrects these problems and maintains high end-to-end TCP throughput. This sublayer will 
be referred to as the ad-hoc TCP sublayer or A-TCP [26]. 

9.5.1 Effect of route recomputations 

As we will see in Chapter 13, when an old route is no longer available, the network layer at 
the sender attempts to find a new route to the destination. In dynamic source routing (DSR) 
this is done via route discovery messages while in destination-sequenced distance- vectoring 
(DSDV) table exchanges are triggered that eventually result in a new route being found. It 
is possible that discovering a new route may take significantly longer than the RTO at the 
sender. As a result, the TCP sender times out, retransmits a packet, and invokes congestion 
control. Thus, when a new route is discovered, the throughput will continue to be small 
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for some time because TCP at the sender grows its congestion window using the slow start 
and congestion avoidance algorithm. This is again undesirable behavior because the TCP 
connection will be very inefficient. If we imagine a network in which route computations 
are done frequently (due to high node mobility), the TCP connection will never get an 
opportunity to transmit at the maximum negotiated rate (i.e. the congestion window will 
always be significantly smaller than the advertised window size from the receiver). 

9.5.2 Effect of network partitions 

It is likely that the ad hoc network may periodically get partitioned for several seconds at 
a time. If the sender and the receiver of a TCP connection lie in different partitions, all the 
sender's packets get dropped by the network, resulting in the sender invoking congestion 
control. If the partition lasts for a significant amount of time (say, several times longer than 
the RTO), the situation gets even worse because of a phenomeon called serial timeouts. 
A serial timeout is a condition wherein multiple consecutive retransmissions of the same 
segment are transmitted to the receiver while it is disconnected from the sender. All these 
retransmissions are, thus, lost. Since the retransmission timer at the sender is doubled 
with each unsuccessful retransmission attempt (until it reaches 64 s), several consecutive 
failures can lead to inactivity lasting 1 or 2 minutes, even when the sender and receiver get 
reconnected. 

9.5.3 Effect of multipath routing 

As we will see in Chapter 13, some routing protocols, such as the temporally ordered routing 
algorithm (TORA) , maintain multiple routes between source destination pairs, the purpose 
of which is to minimize the frequency of route recomputation. Unfortunately, this sometimes 
results in a significant number of out-of-sequence packets arriving at the receiver. The effect 
of this is that the receiver generates duplicate ACKs which cause the sender (on receipt of 
three duplicate ACKs) to invoke congestion control. 

The congestion window in TCP imposes an acceptable data rate for a particular con- 
nection based on congestion information that is derived from timeout events as well as 
from duplicate ACKs. In an ad hoc network, since routes change during the lifetime of a 
connection, we lose the relationship between the congestion window (CWND) size and the 
tolerable data rate for the route. In other words, the CWND as computed for one route may 
be too large for a newer route, resulting in network congestion when the sender transmits 
at the full rate allowed by the old CWND. 

9.5.4 ATCP sublayer 

The ATCP sublayer utilizes network layer feedback (from intermediate hops) to put the TCP 
sender into either a persist state, congestion control state or retransmit state. So, when the 
network is partitioned, the TCP sender is put into persist mode so that it does not needlessly 
transmit and retransmit packets. On the other hand, when packets are lost due to error (as 
opposed to congestion), the TCP sender retransmits packets without invoking congestion 
control. Finally, when the network is truly congested, the TCP sender invokes congestion 
control normally. 
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We do not have to modify standard TCP itself when we want to maintain compatibility 
with the standard TCP/IP suite. Therefore, to implement this solution, we can insert a 
thin layer called ATCP (ad hoc TCP) between IP and TCP that listens to the network 
state information provided by ECN [27] messages and by ICMP 'destination unreachable' 
messages and then puts TCP at the sender into the appropriate state. Thus, on receipt of 
a 'destination unreachable' message, the TCP state at the sender is frozen (the sender 
enters the persist state) until a new route is found, ensuring that the sender does not invoke 
congestion control. Furthermore, the sender does not send packets into the network during 
the period when no route exists between the source and destination. 

The ECN, which will be discussed later in more detail, is used as a mechanism by which 
the sender is notified of impending network congestion along the route followed by the TCP 
connection. On receipt of an ECN, the sender invokes congestion control without waiting 
for a timeout event (which may be caused, more often than not, due to lost packets). Thus, 
the benefits of A-TCP sublayer are: 

• Standard TCP/IP is unmodified. 

• ATCP is invisible to TCP and, therefore, nodes with and without ATCP can interoperate. 
The only drawback is that nodes without ATCP will see all the performance problems 
associated with running TCP over ad hoc networks. 

• ATCP does not interfere with TCPs functioning in cases where the TCP connection is 
between a node in the wireline network and another in the wireless ad hoc network. 

There are in the literature papers dealing with different aspects of TCP problem in ad hoc 
networks. Holland and Vaidya [28], for example, investigate the impact of link breakage on 
TCP performance in such networks. They use dynamic source routing (DSR), discussed in 
Chapter 13 (see also Johnson and Maltz [29]), as the underlying routing protocol (simulated 
in NS2). DSR is an on-demand routing protocol where a sender finds a route to the destination 
by flooding route request packets. DSR's performance is optimized by allowing intermediate 
nodes to respond to route request packets using cached routes. Unfortunately, if the cached 
information maintained at a intermediate node is stale, the time it takes to find a new 
route can be very long (several seconds). Thus, TCP running on top of DSR sees very 
poor throughput. The paper proposes the use of explicit link failure notification (ELFN) to 
improve TCP performance. Here, the TCP sender is notified that a link has failed, and it 
disables its retransmission timer and enters a stand-by mode. In stand-by mode, the TCP 
sender periodically sends a packet in its congestion window to the destination. When an 
ACK is received, TCP leaves the stand-by mode, restores its retransmission timers and 
resumes transmission as normal. 

Chandran et al. [30] discusses a similar scheme for improving TCP performance in ad 
hoc networks in the presence of failures. Here, the router detecting a failed route generates 
a route failure notification (RFN) packet toward the source. The TCP source that receives 
this packet enters a snooze state which is very similar to TCP's persist state. When the 
route is re-established, a route re-establishment notification (RRN) is sent to the source 
by any router on the previous route that detected the new route. This packet removes the 
source from the snooze state. In this method, the source continues using the old congestion 
window size for the new route. This is a problem because the congestion window size is 
route-specific (since it seeks to approximate the available bandwidth). Chandran et al. [30] 
also does not consider the effects of congestion, out-of-order packets and bit error. 
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9.5.5 ATCP protocol design 

In this section we discuss the design of an ATCP that attempts to provide an integral solution 
to the problems of running TCP over multihop wireless networks. Such a protocol should 
have the following characteristics [26]: 

(1) It should Improve TCP performance for connections set up in ad hoc wireless net- 
works. As already discussed, TCP performance is affected by the problems of high 
BER and disconnections due to route recomputation or partition. In each of these 
cases, the TCP sender mistakenly invokes congestion control. A good protocol should 
provide only retransmissions for packets lost due to high BER, without shrinking the 
congestion window. Also, the sender should stop transmitting and resume when a 
new route has been found. the As above, in the case of transient partition, the sender 
should stop transmitting until it is reconnected to the receiver. In the case of multi- 
path routing when TCP at the sender receives duplicate ACKs, it should not invoke 
congestion control because multipath routing shuffles the order in which packets are 
received. 

(2) It should maintain TCP's congestion control. If losses are caused due to network 
congestion, we want TCP to shrink its congestion window in response to losses and 
invoke slow start. 

(3) It should have appropriate CWND. When there is a change in the route (e.g. a 
reconnection after a brief partition), the congestion window should be recomputed. 

(4) It should maintain end-to-end TCP semantics in order to ensure that applications do 
not crash. 

(5) It should be compatible with standard TCP because we cannot assume that all ma- 
chines deployed in an ad hoc network will have ATCP installed. 

The above functionalities are implemented within a separate sublayer, the ATCP, placed 
between the network and TCP layer in the protocol stack (TCP/ATCP/IP stack). This layer 
monitors the TCP state and the state of the network (based on ECN and ICMP messages) 
and takes appropriate action. Figure 9.12 illustrates ATCP's four possible states: normal, 
congested, loss and disconnected. When the TCP connection is initially established, ATCP 
at the sender is in the normal state. 

In this state, ATCP does nothing and is invisible. Let us now examine ATCP's behavior 
under four circumstances. 

9.5.5. 1 Lossy channel 

When the connection from the sender to the receiver is lossy, it is likely that some segments 
will not arrive at the receiver or may arrive out-of-order. Thus, the receiver may generate 
duplicate acknowledgment (ACKs) in response to out-of-sequence segments. When TCP 
receives three consecutive duplicate ACKs, it retransmits the appropriate segment and 
shrinks the congestion window. It is also possible that, due to lost ACKs, the TCP sender's 
RTO may expire, causing it to retransmit one segment and invoke congestion control. 

ATCP in its normal state counts the number of duplicate ACKs received for any segment. 
When it sees that three duplicate ACKs have been received, it does not forward the third 
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Figure 9.12 State transition diagram for ATCP at the sender. 

duplicate ACK but puts TCP in persist mode. Similarly, when ATCP sees that TCP's RTO 
is about to expire, it again puts TCP in persist mode. By doing this, we ensure that the 
TCP sender does not invoke congestion control because that is the wrong thing to do under 
these circumstances. After ATCP puts TCP in persist mode, ATCP enters the loss state. 
In the loss state, ATCP transmits the unacknowledged segments from TCP's send buffer. 
It maintains its own separate timers to retransmit these segments in the event that ACKs 
are not forthcoming. Eventually, when a new ACK arrives (i.e. an ACK for a previously 
unacknowledged segment), ATCP forwards that ACK to TCP, which also removes TCP 
from persist mode. ATCP then returns to its normal state. 

9.5.5.2 Congested 

When the network detects congestion, the ECN flag is set in ACK and data packets. Let 
us assume that ATCP receives this message when in its normal state. ATCP moves into 
its congested state and does nothing. It ignores any duplicate ACKs that arrive and it also 
ignores imminent RTO expiration events. In other words, ATCP does not interfere with 
TCP's normal congestion behavior. After TCP transmits a new segment, ATCP returns to 
its normal state. 

9.5.5.3 Disconnected 

Node mobility in ad hoc networks causes route recomputation or even temporary network 
partition. When this happens, we assume that the network generates an ICMP destina- 
tion unreachable message in response to a packet transmission. When ATCP receives this 
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message, it puts the TCP sender into persist mode and itself enters the disconnected state. 
TCP periodically generates probe packets while in persist mode. When, eventually, the 
receiver is connected to the sender, it responds to these probe packets with a duplicate ACK 
(or a data packet). This removes TCP from persist mode and moves ATCP back into normal 
state. In order to ensure that TCP does not continue using the old CWND value, ATCP sets 
TCP's CWND to one segment at the time it puts TCP in persist state. The reason for doing 
this is to force TCP to probe the correct value of CWND to use for the new route. 



9.5.5.4 Other transitions 

When ATCP is in the loss state, reception of an ECN or an ICMP source quench message will 
move ATCP into congested state and ATCP removes TCP from its persist state. Similarly, 
reception of an ICMP destination Unreachable message moves ATCP from either the loss 
state or the congested state into the disconnected state and ATCP moves TCP into persist 
mode (if it was not already in that state). 



9.5.5.5 Effect of lost messages 

Note that due to the lossy environment, it is possible that an ECN may not arrive at the 
sender or, similarly, a destination unreachable message may be lost. If an ECN message is 
lost, the TCP sender will continue transmitting packets. However, every subsequent ACK 
will contain the ECN, thus ensuring that the sender will eventually receive the ECN, causing 
it to enter the congestion control state as it is supposed to. Likewise, if there is no route 
to the destination, the sender will eventually receive a retransmission of the destination 
unreachable message, causing TCP to be put into the persist state by ATCP. Thus, in all 
cases of lost messages, ATCP performs correctly. Further details on protocol implementation 
can be found in Liu and Singh [26]. 
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Figure 9.13 ATCP and TCP performance in the presence of bit error only. 
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9.5.6 Performance examples 

An experimental testbed, for the evaluation of ATCP protocol, consisting of five Pen- 
tium PCs, each of which with two ethernet cards is described in Liu and Singh [26]. This 
gives a four-hop network where the traffic in each hop is isolated from the other hops. To 
model the lossy and low-bandwidth nature of the wireless links, in IP, a 32 kb/s channel is 
emulated over each hop. Performance examples are shown in Figures 9.13-16(a). A uni- 
formly better performance of ATCP over standard TCP protocols is evident from all these 
examples. 
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Figure 9.14 ATCP and TCP performance in the presence of bit error and congestion. 
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Figure 9.15 ATCP and TCP performance in the presence of bit error and partition. 
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Figure 9.16 ATCP and TCP performance in the presence of bit error and larger partition. 
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Figure 9.17 ATCP and TCP performance in the presence of bit error and packet reordering. 
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Crosslayer Optimization 



10.1 INTRODUCTION 

In many algorithms throughout the book the information from physical layer was used to 
adjust the parameters of the higher layer protocols in the network. In this section we will 
additionally explicitly discuss the gains that can be achieved by means of a cross-layer 
approach, where a certain layer of information is passed to the higher layers in order to 
optimize the system performance [ 1 — 49] . In Chapter 9 we discussed the operation of TCP 
and indicated the importance of being able to distinguish packet losses due to network 
congestion from those due to degradation of physical channel parameters. The reaction 
of TCP on the losses due to congestion will be to change the congestion window size 
and reduce the transmission packet rate, whereas for physical channel degradation simple 
retransmission will suffice. 

Routers in the network indicate congestion by dropping packets, which in turn causes 
the source to adaptively decrease its sending rate. The ECN mechanism used to notify 
the receiver whenever congestion occurs in the network was also discussed in the previous 
chapter. This mechanism works in the following manner: included in a TCP packet's header 
is the ECN bit which is set to zero by the source. If the router detects congestion, it will 
set the ECN bit to one, and the packet is said to be marked. The marked packet eventually 
reaches the destination, which in turn informs the source about the value of the mark (i.e. 
the ECN bit value). The source adapts its transmission rate depending on the value of the 
mark. 

The current deployment of the TCP protocol interprets all losses as being congestion- 
related. Whenever losses occur over a wireless channel, the TCP source reacts to this as 
though it was due to congestion and thus decreases the packet transmission rate, causing 
loss in network throughput. A solution that has been proposed to mitigate this problem is to 
'smooth' the channel by suitable coding and link-layer ARQ at a faster timescale than that 
of the TCP control loop so that the wireless link ideally is perceived as a constant channel, 
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but with lower capacity. However, in practice, there is still the problem that the TCP sender 
may not be fully shielded from wireless link losses. This can lead to the TCP congestion 
control mechanism reacting to packet losses, thus resulting in redundant retransmissions 
and loss of throughput. However, once ECN-enabled TCP is deployed, where the ECN 
bit can be used to mark packets to indicate congestion, there is a means of differentiating 
between congestion-related loss and wireless channel-related loss. 

Thus, the channel need not be smoothed because the ECN mechanism provides a means 
of explicitly indicating congestion. In Chapter 9 it was demonstrated that, in a single user 
environment, if packets are marked based solely on congestion information, there is no 
significant degradation of TCP performance due to the time-varying nature of the wireless 
channel as compared with wired networks. Such an approach is an example of where a 
crosslayer view of physical layer information (channel conditions) is used at the network 
layer to significantly improve network-layer throughput performance. 

As a second example consider a conventional cellular system with a fixed base station and 
a number of mobile users. Data flows (packets) arrive from the core network to the cell base 
station and are destined for the mobile users, with the packets for each user being queued 
temporarily at the base station (a separate queue is maintained for each user). The objective 
of the base station is to schedule these packets to various mobiles in a timely manner. 
Owing to the asymmetric nature of the traffic we can restrict ourselves to just considering 
the forward link problem, where the direction of the data flow is from the base station to the 
mobile user. An efficient way to use the cellular spectrum is to dedicate some bandwidth for 
data transport and allocate this bandwidth in a dynamic manner among various data users. 
For instance, time could be divided into fixed-size time slots and users allocated time slots 
dynamically. The simplest scheduling algorithm is the round-robin mechanism, where users 
are periodically allocated slots irrespective of whether there is data to be sent to a particular 
mobile user. Note that this simple TDM scheme still leads to some wasted bandwidth during 
inactivity. Furthermore, it suffers from a more subtle problem: this approach is independent 
of the channel state (a channel-state-independent scheduling mechanism). 

A channel-state-dependent scheduling algorithm can provide significant gains. As an 
illustration consider a wireless system consisting of three users as in Figure 10.1. For this 
example, we consider access time to be slotted, and the channels to be constant over a 
time slot. For demonstration, let us assume that the channels are either ON or OFF, equally 
likely, and the channels are independent of each user. Thus, in this system, there are eight 
possible (instantaneous) channel states for the three independent users, ranging from (ON, 
ON, ON) to (OFF, OFF, OFF). When a user's channel is ON, one packet can be transmitted 
successfully to the mobile user during the time slot. The system is assumed to be a TDM 
system; thus, the base station can transmit to only one user on each time slot. The associated 
scheduling problem is to decide which user is allowed access to the channel during each 
particular time slot. A naive scheduling rule would be to employ the well-known round- 
robin mechanism. In such a scheme, the users are periodically given access to the channel, 
with each user getting one-third of the slots over time. As the channel of each user is equally 
likely to be ON or OFF in each time slot, it follows that, over time, on average, each user 
will get a data rate of (1 /2)(l/3) =1/6 packets/slot. 

On the other hand, suppose the base station uses knowledge of the instantaneous channel 
state. Then a simple policy would be to schedule and transmit to a user whose channel is 
in the ON state. If more than one user's channel is in the ON state, the scheduler could 
pick a user randomly (equally likely) among those users whose channels are ON, and send 
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data to the selected user. This simple example assumes that all users have identical traffic 
demands. For the case where some users have greater needs than others, high-demand users 
would be assigned channel access with greater likelihood. In this case data is not sent by 
the base station if and only if all users' channels are OFF (which occurs on average only 
one-eighth of the time). Thus, the total data rate achieved by this state-dependent rule is 
1 — i = ? packets/slot. As this rule is symmetric across users, it follows that on average, the 
data rate per user is (l/3)(7/8) = 7/24 packets/slot, which is almost twice the throughput 
as the round-robin scheme that provided g packets/slot. In addition, the base station does 
not radiate power during bad channel conditions, thereby decreasing interference levels in 
the wireless network. 

This gain achieved due to channel-state-dependent scheduling is called the multi-user di- 
versity gain. This example illustrates the significance of multi-user diversity gain in network 
scheduling. However, a central question is how to design online algorithms that achieve this 
gain, while also supporting diverse QoS requirements for various users for realistic channel 
scenarios. While this question is not yet completely answered, there has been extensive 
research on various aspects of the problem [2]. 
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10.2 A CROSS-LAYER ARCHITECTURE FOR VIDEO DELIVERY 

With time-varying wireless link quality, providing QoS for video applications in the form 
of absolute guarantee [51, 52] may not be feasible. Thus, it is more reasonable to provide 
QoS in the form of soft (or 'elastic') guarantee, which allows QoS parameters in the priority 
transmission system to be adjusted along with changing channel conditions. The relative 
QoS differentiation discussed in References [52-54] is one of the possible solutions for 4G 
adaptive QoS system. Similarly, on the application layer, it is desirable to have a video bit- 
stream that is adaptive to changing channel conditions. Among several possible approaches 
for video quality adaptation [55, 56], a scalable video [50] seems to be attractive due to its 
low complexity and high flexibility in rate adaptation. 

Figure 10.2 shows a cross-layer QoS mapping architecture for video delivery over single- 
hop wireless networks. This architecture represents an end-to-end delivery system for a 
video source from the sender to the receiver, which includes source video encoding module, 
cross-layer QoS mapping and adaptation module, link-layer packet transmission module, 
wireless channel (time-varying and non-stationary), adaptive wireless channel modeling 
module, and video decoder/output at the receiver. In this context, the wireless channel is 
modeled at the link layer (instead of the physical layer) since the link layer modeling is more 
amenable for analysis and simulations of the QoS provisioning system (e.g. delay bound 
or packet loss rate). For these details see Chapter 8 and the discussion on effective channel 
capacity. In the link-layer transmission control module in the architecture, a class-based 
buffering and scheduling mechanism is employed to achieve differentiated services. In 
particular, A' QoS priority classes are maintained with each class of traffic being maintained 
in separate buffers. A strict (nonpre-emptive) priority scheduling policy is employed to serve 
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Figure 10.3 GOP structure of MPEG-4 PFGS (progressive fine granularity scalable) scal- 
able video. (Reproduced by permission of IEEE [50].) 

packets among the classes. At the video application layer, each video packet is characterized 
based on its loss and delay properties, which contribute to the end-to-end video quality and 
service. Then, these video packets are classified and optimally mapped to the classes of 
link transmission module under the rate constraint [57, 58]. The video application layer 
QoS and link-layer QoS are allowed to interact with each other and adapt along with the 
wireless channel condition. The objective of these interaction and adaptation is to find 
a satisfactory QoS tradeoff so that each end user's video service can be supported with 
available transmission resources. 

Figure 10.3 shows a group-of-picture (GOP) structure of MPEG-4 PFGS (progressive 
fine granularity scalable) [56]. Suppose that there are video frames and there are video 
layers in one GOP. Therefore, the loss of any video portion will affect the end-to-end 
video quality due to the interdependency within the encoding structure. Each video layer 
is packetized into several fixed-size packets before transmission. Each video packet cannot 
contain video data across video layers or video frames. Packets from the same video layer 
are put onto the same priority class. Furthermore, suppose that the video playback frame 
rate at the end user is fixed at F frames/s. If the mobile terminal starts to play back the first 
video frame of a GOP at time Tp, video frame n in the same GOP should be received and be 
ready to be displayed before time Tj(n) = T p + [(n — l)/F] for uninterrupted playback. 

Let p = (pi, p2, ■ ■ ■ , Pm) be the mapping policy from M video layers to K priority 
classes, where pj e {0, 1, . . . , K] is the priority class that video layer j is transmitted. 
pj = 0 represents the fact that video layer j is abstained from transmission. The overall 
expected distortion from the mapping scheme p can be derived using the dependent structure 
of scalable video [50, 57], as shown in Figure 10.3, which can be expressed as follows: 



M 

D(p) = d 0 -J2 .\/>;'. ; 



(10.1) 
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where 

AD r<j = AD J II - to' K ■ )l ( 10 - 2 ) 

D(p)is the total expected distortion from mapping TV scalable frames to A" different priority 
classes with an allocation policy p, D 0 is the expected distortion if no video data are 
received, ADj is the distortion reduction if video layer j is correctly received. The term 
ri/<j [1 ~~ Pj'(Qp/' <Ppj,)] m Equation (10.2) is the probability that video layer j and all 
video layers /, on which video layer depends, are correctly received while video layer / is 
transmitted over the priority class py . The priority class py has QoS exponents 9 Pj , and <j) p ., , 
which correspond to its guaranteed buffer overflow and delay bound probability, respectively 
(for details see Chapter 8). On the other hand, f$y{9 p . t , <p Pjl ) is the probability that video 
layer j' is lost due to either buffer overflow or playback deadline violation when transmitted 
over priority class py (see again Chapter 8 for details). 

The optimal mapping problem can be formally stated as follows. Given the set of rate 
constraints under the priority transmission system in described above, and the expected 
channel service rate r, which can be considered stationary in a time period t, corresponding 
to one GOP, what is the optimal mapping policy p from one GOP with TV scalable frames 
(coded in M video layers) to K priority classes such that D(p) is minimized? This can be 
expressed as 

minD(p) 

s.t. bf < p,i(kj)-t, i = 1, K (10.3) 

Vj,Pj=i 

K 

^miki) < r 
;=1 

where /n,(A:,)is the rate constraint of priority class i, and b P j J is the size of video layer j, 
which will be conveyed by priority class pj. The solution to the optimization problem 
follows a constrained-based search that exploits the dependency among the layers [57]. 
This is a source coding problem and will not be discussed within this chapter. 

Video adaptation is based on using a set of QoS bounds to characterize the range of 
video quality requirements and transmission service capabilities. Within this set of bounds, 
QoS parameters of video and transmission service can be adjusted to cope with the time- 
varying and nonstationary wireless link quality. Owing to the time-varying characteristics 
of video content and time-varying wireless channel, the set of bounds is also time-varying. 
Specifically, the QoS bound for video application at time t can be defined as the video 
distortion of GOP as 

Pit) = {s u (0 u ), s iM (6 u )} (10.4) 

where £,,£(0, l) and £i,u(9i,u) are the respective lower and upper bounds of the guar- 
anteed buffer overflow probability by priority corresponding to QoS exponent Q itL and 
#,■,£/. Similarly, the rate constraint corresponding to the statistical QoS guarantee can be 
expressed as 



(10.5) 
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Figure 10.4 Interaction between video applications and priority networks for QoS adap- 
tation. 

where p-i(9ii) and Pi(6i t u) are the respective rate constraints corresponding to e^lC^.l) 
and e,,£/(0,,f/). Note that the range for guaranteed packet delay can also be obtained from 
the guaranteed buffer overflow probability (see Chapter 8). 

Cross-layer interaction in video QoS adaptation process consists of optimally adjusting 
the video encoding behavior based on the QoS bounds. Basically the link layer estimates the 
available effective channel capacity r by using methods described in Chapter 8. Based on 
this estimate, the encoder optimally maps the frames into the priority classes as defined by 
Equation (10.3). The process updating rate depends on channel variation rate. The process 
is illustrated in Figure 10.4. PSNR improvement of the order of 3 dB is demonstrated in 
such a scheme [50]. 
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11 

Mobility Management 



11.1 INTRODUCTION 



As already indicated in Chapter 1 (see Figure 1.1), 4G wireless networks will integrate 
services of different segments such as cellular networks, WLAN, WPAN and even LEO 
satellites. Several alternative backbone networks will be used, like the public land mobile 
networks (PLMN), mobile Internet protocol (mobile IP) networks, wireless asynchronous 
transfer mode (WATM) networks, and low Earth orbit (LEO) satellite networks. Regardless 
of the network, one of the most important and challenging problems for wireless com- 
munication and computing is mobility management [1-62]. Mobility management enables 
telecommunication networks to locate roaming terminals for call delivery and to maintain 
connections as the terminal is moving into a new service area, process known as handoff. 
The handoff may be executed between different segments (cells) of the same or different 
systems. The handoff event is caused by radio link degradation or initiated by the system 
that rearranges radio channels in order to avoid congestion. Our focus in this section is on 
the first kind of handoff, where the cause of handoff is poor radio quality due to a change 
in the environment or the movement of the wireless terminal. For example, the mobile user 
might cross cell boundaries and move to an adjacent cell while the call is in process. In this 
case, the call must be handed off to the neighboring cell in order to provide uninterrupted 
service to the mobile subscriber. If adjacent cells do not have enough channels to support 
the handoff, the call is forced to be blocked. In systems where the cell size is relatively small 
(microcellular systems), the handoff procedure has an important effect on the performance 
of the system. Here, an important issue is to limit the probability of forced call termination, 
because from the point of view of a mobile user, forced termination of an ongoing call is 
less desirable than blocking a new call. Therefore, the system must reduce the chances of 
unsuccessful handoffs by reserving some channels explicitly for handoff calls. For exam- 
ple, handoff prioritizing schemes are channel assignment strategies that allocate channels 
to handoff requests more readily than new calls. 
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Figure 11.1 Components of location management process. 



Thus, mobility management supports mobile terminals, allowing users to roam while 
simultaneously offering them incoming calls and supporting calls in progress. Mobility 
management consists of location management and handoff management. 

Location management is a process that enables the network to discover the current 
attachment point of the mobile user for call delivery. The main components of the process 
are shown in Figure 11.1. The first segment is location registration (or location update). 
In this stage, the mobile terminal periodically notifies the network of its new access point, 
allowing the network to authenticate the user and revise the user's location profile. The 
second segment is call delivery. Here the network is queried for the user location profile and 
the current position of the mobile host is found. The main issues in location management 
involve database architecture design, design of messaging procedures and the transmission 
of signaling messages between various components of a signaling network. Other issues 
include: security, dynamic database updates, querying delays, terminal paging methods and 
paging delays. 

Handoff ( or handover) management enables the network to maintain a user's connection 
as the mobile terminal continues to move and change its access point to the network. The 
three-stage process for handoff first involves initiation, where the user, a network agent 
or changing network conditions identify the need for handoff. The second stage is new 
connection generation, where the network must find new resources for the handoff con- 
nection and perform any additional routing operations. Under network-controlled handoff 
(NCHO) or mobile-assisted handoff (MAHO), the network generates a new connection, by 
finding new resources for the handoff and performing any additional routing operations. For 
mobile-controlled handoff (MCHO), the mobile terminal finds the new resources and the 
network approves. The final stage is data-flow control, where the delivery of the data from 
the old connection path to the new connection path is maintained according to agreed-upon 
QoS. The segments of handoff management are presented in Figure 11.2. 

Handoff management includes two conditions: intracell handoff and intercell handoff. 
Intracell handoff occurs when the user moves within a service area (or cell) and experiences 
signal strength deterioration below a certain threshold that results in the transfer of the user's 
calls to new radio channels of appropriate strength at the same (BS). Intercell handoff occurs 
when the user moves into an adjacent cell and all of the terminal's connections must be 
transferred to a new BS. While performing handoff, the terminal may connect to multiple 
BSs simultaneously and use some form of signaling diversity to combine the multiple 
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Figure 11.2 Components of handoff management. 

signals. This is called soft handoff. On the other hand, if the terminal stays connected to 
only one BS at a time, clearing the connection with the former BS immediately before 
or after establishing a connection with the target BS, then the process is referred to as 
hard handoff. Handoff management issues are: efficient and expedient packet processing, 
minimizing the signaling load on the network, optimizing the route for each connection, 
efficient bandwidth reassignment and refining quality of service for wireless connections. 
Below we will discuss the handoff management in some of the component networks of 4G 
integrated wireless network concept as suggested by Figure 1.1. 

11.1.1 Mobility management in cellular networks 

Mobile terminals (MTs) are free to travel and thus the network access point of an MT 
changes as it moves around the network coverage area. As a result, the ID of an MT does 
not implicitly provide the location information of the MT and the call delivery process 
becomes more complex. The current systems for PLMN location management strategies 
require each MT to register its location with the network periodically. In order to perform 
the registration, update and call delivery operations described above, the network stores the 
location information of each MT in the location databases. Then the information can be 
retrieved for call delivery. 

Current schemes for PLMN location management are based on a two-level data hierarchy 
such that two types of network location database, the home location register (HLR) and 
the visitor location register (VLR), are involved in tracking an MT. In general, there is 
an HLR for each network and a user is permanently associated with an HLR in his/her 
subscribed network. Information about each user, such as the types of services subscribed 
and location information, are stored in a user profile located at the HLR. The number of 
VLRs and their placements vary among networks. Each VLR stores the information of the 
MTs (downloaded from the HLR) visiting its associated area. 

Network management functions, such as call processing and location registration, are 
achieved by the exchange of signaling messages through a signaling network. Signaling 
system 7 (SS7), described in Chapter 1 [34, 38, 63], is the protocol used for signaling 
exchange, and the signaling network is referred to as the SS7 network. 

The type of cell site switch (CSS) currently implemented for the PLMN is known as 
a mobile switching center (MSC). Figure 11.3 shows the SS7 signaling network which 
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connects the HLR, the VLRs, and the MSCs in a PLMN-based network. The signal transfer 
points (STPs), as shown in Figure 11.3, are responsible for routing signaling messages. 

As mentioned previously, location management includes two major tasks: location regis- 
tration (or update) and call delivery. In order to correctly deliver calls, the PLMN must keep 
track of the location of each MT. As described previously, location information is stored 
in two types of databases, VLR and HLR. As the MTs move around the network coverage 
area, the data stored in these databases may no longer be accurate. To ensure that calls can 
be delivered successfully, the databases are periodically updated through the process called 
location registration. 

Location registration is initiated by an MT when it reports its current location to the 
network. This reporting process is referred to as location update. Current systems adopt an 
approach such that the MT performs a location update whenever it enters a new location area 
(LA). Each LA consists of a number of cells and, in general, all base station transmission 
systems (BTSs) belonging to the same LA are connected to the same MSC. 

When an MT enters an LA, if the new LA belongs to the same VLR as the old LA, the 
record at the VLR is updated to record the ID of the new LA. Otherwise, if the new LA 
belongs to a different VLR, a number of extra steps are required to: (1) register the MT at 
the new serving VLR; (2) update the HLR to record the ID of the new serving VLR; and 
(3) deregister the MT at the old serving VLR. Figure 11.4 shows the location registration 
procedure when an MT moves to a new LA. The following is the ordered list of tasks that 
are performed during location registration. 

(1) The MT enters a new LA and transmits a location update message to the new BS. 

(2) The BS forwards the location update message to the MSC which launches a regis- 
tration query to its associated VLR. 

(3) The VLR updates its record on the location of the MT. If the new LA belongs to a 
different VLR, the new VLR determines the address of the HLR of the MT from 
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Figure 11.4 Location registration procedures. 

its mobile identification number (MIN). This is achieved by a table lookup proce- 
dure called global title translation. The new VLR then sends a location registration 
message to the HLR. Otherwise, location registration is complete. 

(4) The HLR performs the required procedures to authenticate the MT and records the 
ID of the newserving VLR of the MT. The HLR then sends a registration acknowl- 
edgment message to the new VLR. 

(5) The HLR sends a registration cancellation message to the old VLR. 

(6) The old VLR removes the record of the MT and returns a cancellation acknowledg- 
ment message to the HLR. 

Call delivery consists of two major steps (1) determining the serving VLR of the called 
MT and (2) locating the visiting cell of the called MT. Locating the serving VLR of the MT 
involves the following procedure, shown in Figure 11.5: 

(1) The calling MT sends a call initiation signal to the serving MSC of the MT through 
a nearby BS. 

(2) The MSC determines the address of the HLR of the called MT by global title 
translation and sends a location request message to the HLR. 

(3) The HLR determines the serving VLR of the called MT and sends a route request 
message to the VLR. This VLR then forwards the message to the MSC serving 
the MT. 

(4) The MSC allocates a temporary identifier called temporary local directory number 
(TLDN) to the MT and sends a reply to the HLR together with the TLDN. 

(5) The HLR forward this information to the MSC of the calling MT. 

(6) The calling MSC requests a call setup to the called MSC through the SS7 network. 
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Figure 11.5 Call delivery procedures. 

The procedure described above allows the network to set up a connection from the calling 
MT to the serving MSC of the called MT. Since each MSC is associated with an LA and 
there is more than one cell in each LA, a mechanism is therefore necessary to determine the 
cell location of the called MT. In current PLMN networks, this is achieved by a paging (or 
alerting) procedure, such that polling signals are broadcast to all cells within the residing 
LA of the called MT. On receiving the polling signal, the MT sends a reply which allows 
the MSC to determine its current residing cell. As the number of MTs increases, sending 
polling signals to all cells in an LA whenever a call arrives may consume excessive wireless 
bandwidth. We describe a number of proposed paging mechanisms for reducing the paging 
cost later. 

11.1.2 Location registration and call delivery in 4G 

Location registration involves the updating of location databases when current location 
information is available. On the other hand, call delivery involves the querying of location 
databases to determine the current location of a called MT. These can be costly processes, 
especially when the MT is located far away from its assigned HLR. For example, if the 
MT is currently roaming USA and its HLR is in Finland, a location registration message 
is transmitted from USA to Finland whenever the MT moves to a new LA that belongs 
to a different VLR. Under the same scenario, when a call for the MT is originated from a 
nearby MT in USA, the MSC of the calling MT must first query the HLR in Finland before 
it finds out that the called MT is located in the same area as the caller. As the number of 
mobile subscribers keeps increasing, the volume of signaling traffic generated by location 
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management is extremely high. Methods for reducing the signaling traffic in 4G networks 
are therefore needed. 

Research in this area generally falls into two categories. In the first category, extensions 
to the existing location management strategy are developed which aim at improving the 
IS-41 (or GSM/UMTS) scheme while keeping the basic database network architecture 
unchanged. This type of solution has the advantage of easy adaptation to the current PLMN 
networks without major modification. These schemes are based on centralized database 
architectures inherited from the existing standards. Another category of research results 
lies in completely new database architectures that require a new set of schemes for location 
registration and call delivery. Most of these schemes are based on distributed database 
architectures. Some additional research efforts involve: the reverse virtual call setup - a new 
scheme for delivering mobile-terminated calls [23], an optimal routing scheme based on the 
ratio of source messaging to location update rates [61], and a single registration strategy for 
multitier PCS systems [33]. In what follows, we discuss centralized vs distributed database 
architectures. 

1 1. 1.2. 1 Centralized database architectures 

This solution consists of the two-tier database structure with additional optimizations that 
aim to reduce the location management cost. The extension to include inter-technology 
roaming is also expected. 

Dynamic hierarchical database architecture 

The first centralized database architecture is the dynamic hierarchical database architecture 
presented in Ho and Akyildtz [18]. The proposed architecture is based on that of the IS-41 
standard with the addition of a new level of databases called directory registers (DRs). Each 
DR covers the service area of a number of MSCs. The primary function of the DRs is to 
compute periodically and store the location pointer configuration for the MTs in its service 
area. Each MT has its unique pointer configuration and three types of location pointers are 
available at the DR: 

(1) A local pointer is stored at an MT serving DR, which indicates the current serving 
MSCoftheMT. 

(2) A direct remote pointer is stored at a remote DR, which indicates the current serving 
MSC oftheMT. 

(3) An indirect remote pointer is stored at a remote DR, which indicates the current 
serving DR of the MT. 

In addition, the HLR of the MT may be configured to store a pointer to either the serving 
DR or the serving MSC of the MT. In some cases, it may be more cost-effective not to set 
up any pointers, and the original IS-41 scheme will be used. 

As an example, if the intertechnology roaming is supported, suppose that the HLR of 
a given MT is located in Finland and it is currently roaming in Chicago. If a significant 
number of the incoming calls for the MT are originating from Los Angeles, a direct or 
indirect remote pointer can be set up for the MT in the DR at the Los Angeles area. When 
the next call is initiated for this MT from Los Angeles, the calling MSC first queries the 
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DR and the call can be immediately forwarded to Chicago without requiring a query at the 
HLR, which is located in Finland. This reduces the signaling overhead for call delivery. 
On the other hand, the HLR can be set up to record the ID of the serving DR (instead of 
the serving MSC) of the MT. When the MT moves to another MSC within the same LA in 
Illinois area, only the local pointer at the serving DR of the MT has to be updated. Again, 
it is not necessary to access the HLR in Finland. This reduces the signaling overhead for 
location registration. The advantage of this scheme is that it can reduce the overhead for 
both location registration and call delivery. 

1 1. 1.2.2 Multiple-copy location information strategies 

A number of different strategies have been considered to facilitate the search for the user 
location. The basic idea of the per-user location caching strategy [25] is that the volume of 
signaling and database access traffic for locating an MT can be reduced by maintaining a 
cache of location information at a nearby STR Whenever the MT is accessed through the 
STP, an entry is added to the cache which contains a mapping from the ID of the MT to 
that of its serving VLR. When another call is initiated for an MT, the STP first checks if a 
cache entry exists for the MT. If no cache entry for the MT exists, the call delivery scheme 
as described earlier is used to locate the MT. If a cache entry exists, the STP will query 
the VLR as specified by the cache. If the MT is still residing under the same VLR, a hit 
occurs and the MT is found. If the MT has already moved to another location which is not 
associated with the same VLR, a miss occurs and the call delivery scheme is used to locate 
the MT. 

In order to reduce the number of misses, it is suggested in Lin [35] that cache entries 
should be invalidated after a certain time interval. Based on the mobility and call arrival 
parameters, the author [35] introduces a threshold scheme which determines the time when 
a particular cached location information should be cleared such that the cost for call delivery 
can be reduced. 

User profiles can be replicated at selected local databases. When a call is initiated for a 
remote MT, the network first determines if a replication of the called MTs user profile is 
available locally. If the user profile is found, no HLR query is necessary and the network 
can locate the called MT based on the location information available at the local database. 
Otherwise, the network locates the called MT following the standard procedures. When 
the MT moves to another location, the network updates all replications of the MT's user 
profile. This results in higher signaling overhead for location registration. Depending on 
the mobility rate of the MT and the call arrival rate from each location, this method may 
significantly reduce the signaling and database access overhead for local management. 

1 1. 1.2.3 Local extentions 

Pointer forwarding and local anchoring are the strategies where only the far end segment of 
the rout is modified. The basic idea of the pointer forwarding strategy [24] is that, instead 
of reporting a location change to the HLR every time the MT moves to an area belonging 
to a different VLR, the reporting can be eliminated by simply setting up a forwarding 
pointer from the old VLR to the new VLR. When a call for the MT is initiated, the network 
locates the MT by first determining the VLR at the beginning of the pointer chain and then 
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Figure 11.6 Pointer forwarding strategy. 

following the pointers to the current serving VLR of the MT. To minimize the delay in 
locating an MT, the length of the pointer chain is limited to a predefined maximum value K. 
Figure 1 1.6 demonstrates the operation of pointer forwarding. A slight modification of the 
above scheme is the local anchoring [19], where a VLR close to the MT is selected as its 
local anchor. Instead of transmitting registration messages to the HLR, location changes 
are reported to the local anchor. Since the local anchor is close to the MT, the signaling cost 
incurred in location registration is reduced. The HLR keeps a pointer to the local anchor. 
When an incoming call arrives, the HLR queries the local anchor of the called MT which, 
in turn, queries the serving VLR to obtain a routable address to the called MT. Figure 1 1.7 
demonstrates the local anchoring scheme. 



1 1. 1.2.4 Distributed database architectures 

This type of solution is a further extension of the multiple copy concept and consists of 
multiple databases distributed throughout the network coverage area. In a. fully distributed 
registration scheme [18] the two-level HLR/VLR database architecture is replaced by a 
large number of location databases. These location databases are organized as a tree with 
the root at the top and the leaves at the bottom. The MTs are associated with the leaf (lowest 
level) location databases and each location database contains location information of the 
MTs that are residing in its subtree. 

Database hierarchy, introduced in Anantharam etal. [10] is similar to the fully distributed 
registration scheme [58]. Here, MTs may be located at any node of the tree hierarchy (not 
limited to the leaf nodes). The root of the tree contains a database but it is not necessary 
for other nodes to have databases installed. These databases store pointers for MTs. If an 
MT is residing at the subtree of a database, a pointer is set up in this database pointing to 
the next database along the path to the MT. If there is no more database along this path, the 
pointer points to the residing node of the MT. When a call for an MT is initiated at a node 
on the tree, the called MT can be located by following the pointers of the MT. 
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Figure 11.7 Local anchoring scheme. 

Another form of distributed concept is partitioning. Since the mobility pattern of the 
MTs varies among locations, partitions can be generated by grouping location servers 
among which the MT moves frequently. Location registration is performed only when the 
MT enters a partition. 

1 1. 1.2.5 Location update and terminal paging options for 4G 

Current PCS networks partition their coverage areas into a number of LAs. Each LA consists 
of a group of cells and each MT performs a location update when it enters an LA. When an 
incoming call arrives, the network locates the MT by simultaneously paging all cells within 
the LA. The main drawbacks of this location update and paging scheme are: 

(1) Requiring the network to poll all cells within the LA each time a call arrives may 
result in excessive volume of wireless broadcast traffic. 

(2) The mobility and call arrival patterns of MTs vary, and it is generally difficult to 
select an LA size that is optimal for all users. An ideal location update and paging 
mechanism should be able to adjust on a per-user basis. 

(3) Finally, excessive location updates may be performed by MTs that are located around 
LA boundaries and are making frequent movements back and forth between two LAs. 

In addition, the LA-based location update and paging scheme is a static scheme as it 
cannot be adjusted based on the parameters of an MT from time to time. Recent research 
efforts for 4G have attempted to reduce the effects of these inefficiencies. Excessive location 
updates are discussed in Rose [5 1 ] and B ar-Noy et al. [ 1 2] . A timer-based strategy that uses a 
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universal timeout parameter is presented in Rose [51], while a tracking strategy for mobile 
users in PCS networks based on cell topology is explored and compared with the time- 
based strategy in Bar-Noy et al. [12]. For excessive polling, a one-way paging network 
architecture and the interfaces among paging network elements are examined in Lin [30]. 
Additional schemes attempt to reduce the cost of finding a user when the MT moves during 
the paging process [48, 62]. Many recent efforts have focused primarily on dynamic location 
update mechanisms which perform location update based on the mobility of the MTs and 
the frequency of incoming calls. In the following a number of dynamic location update and 
paging schemes is presented. 

Location updating 

The standard LA-based location update method does not allow adaptation to the mobility 
characteristics of the MTs. The 4G solutions should allow dynamic selection of location 
update parameters, resulting in lower cost. Dynamic LA management introduces a method 
for calculating the optimal LA size given the respective costs for location update and cell 
polling. A mesh cell configuration with square-shaped cells is considered. Each LA consists 
of k x k cells arranged in a square, and the value of k is selected on a per-user basis according 
to the mobility and call arrival patterns and the cost parameters. This mechanism performs 
better than the static scheme in which LA size is fixed. However, it is generally not easy to 
use different LA sizes for different MTs as the MTs must be able to identify the boundaries of 
LAs, which are continuously changing. The implementation of this scheme is complicated 
when cells are hexagonal shaped, or in the worst case, when irregular cells are used. 

Dynamic update schemes are also examined in Reference [13]. In a time based scheme 
an MT performs location updates periodically at a constant time interval AT. Figure 1 1.8 
shows the path of an MT. If a location update occurred at location A at time 0, subsequent 
location updates will occur at locations B, C and D if the MT moves to these locations 
at times AT, 2AT and 3A7\ respectively. In a movement-based scheme, an MT performs 
a location update whenever it completes a predefined number of movements across cell 
boundaries (this number is referred to as the movement threshold). Assuming a movement 
threshold of three is used, the MT performs location updates at locations B and C, as shown 
in Figure 11.9. In a distance-based scheme an MT performs a location update when its 
distance from the cell where it performed the last location update exceeds a predefined 




Figure 1 1.8 Time-based location update scheme. 
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Figure 1 1.9 Movement-based location update scheme. 




Figure 11.10 Distance-based location update scheme. 

value (this distance value is referred to as the distance threshold). Figure 11.10 shows the 
same path as Figure 1 1.8. A location update is performed at location B where the distance 
of the MT from location A exceeds the threshold distance (solid line). 

An iterative distance-based location update scheme introduces an iterative algorithm 
that can generate the optimal threshold distance that results in the minimum cost. When 
an incoming call arrives, cells are paged in a shortest-distance-first order such that cells 
closest to the cell where the last location update occurred are polled first. The delay in 
locating an MT is, therefore, proportional to the distance traveled since the last location 
update. Results demonstrate that, depending on the mobility and call arrival parameters, 
the optimal movement threshold varies widely. This demonstrates that location update 
schemes should be per-user-based and should be dynamically adjusted according to the 
current mobility and call arrival pattern of the user. However, the number of iterations 
required for this algorithm to converge varies depending on the mobility and call arrival 
parameters considered. Determining the optimal threshold distance may require significant 
computation resources at the MT. 

A dynamic time-based location update scheme is introduced in Akyildiz and Ho [8]. 
The location update time interval is determined after each movement based on the prob- 
ability distribution of the call interarrival time. This scheme does not make any specific 
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assumptions on the mobility pattern of the MT's, and the shortest-distance-first paging 
scheme as described above is used. It is demonstrated that the results obtained are close to 
the optimal results given in Ho and Akyildiz [20]. Computation required by this scheme 
is low and they are, therefore, feasible for application in MTs that have limited computing 
power. The time required to locate an MT is directly proportional to the distance traveled 
since the last location update. 

Terminal paging optimization provides a trade-off between paging cost and paging delay. 
Paging subject to delay constraints is considered in Rose and Yates [52] . The authors assume 
that the network coverage area is divided into LAs, and the probability that an MT is residing 
in a LA is given. It is demonstrated that, when delay is unconstrained, the polling cost is 
minimized by sequentially searching the LAs in decreasing order of probability of containing 
the MT. For constrained delay, the authors obtain the optimal polling sequence that results 
in the minimum polling cost. However, the authors assume that the probability distribution 
of user location is provided. This probability distribution may be user-dependent. A location 
update and paging scheme that facilitates derivation of this probability distribution is needed 
in order to apply this paging scheme. The tradeoff between the costs of location update and 
paging is not considered in Rose and Yates [52]. 

Location update and paging subject to delay constraints is considered in Akyildiz and 
Ho [20]. Again, the authors consider the distance-based location update scheme. However, 
paging delay is constrained such that the time required to locate an MT is smaller than 
or equal to a predefined maximum value. When an incoming call arrives, the residing 
area of the MT is partitioned into a number of subareas. These subareas are then polled 
sequentially to locate the MT. By limiting the number of polling areas to a given value such 
as N, the time required to locate a mobile is smaller than or equal to the time required for 
N polling operations. Given the mobility and call arrival parameters, the threshold distance 
and the maximum delay, an analytical model is introduced that generates the expected cost 
of the proposed scheme. An iterative algorithm is then used to locate the optimal threshold 
distance that results in the lowest cost. It is demonstrated that the cost is the lowest when 
the maximum delay is unconstrained. However, by slightly increasing the maximum delay 
from its minimum value of 1, the cost is significantly lowered. Another scheme using the 
movement-based location update is reported in Akyildiz and Ho [7]. Similar to Ho and 
Akyildiz [20], paging delay is confined to a maximum value. Movement-based location 
update schemes have the advantage that implementation is simple. The MTs do not have to 
know the cell configuration of the network. The scheme introduced in Akyildiz and Ho [7] 
is feasible for use in current PLMN networks. 

Thus as a summary, remaining open problems for the 4G PLMN-based backbone are: 
( 1 ) research work should consider the development of dynamic schemes that limit or enhance 
the distribution of location information on a per-user basis; (2) ongoing research efforts 
should attempt to reach some middle ground between centralized database architectures 
and distributed database architectures; and (3) future research should focus on the design 
of dynamic location update and paging schemes that are simple to implement. 

11.1.2.6 Mobility management for mobile IP 

Existing standards for terminal mobility over the Internet are described in Perkins [43, 44]. 
The mobility-enabling protocol for the Internet, mobile IP, enables terminals to move from 
one subnetwork to another as packets are being sent, without interrupting this process. An 
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Figure 11.11 Mobile IP architecture. 

MN is a host or router that changes its attachment point from one subnet to another without 
changing its IP address. The MN accesses the Internet via a home agent (HA) or a foreign 
agent (FA). The HA is an Internet router on the MNs home network, while the FA is a 
router on the visited network. The node at the other end of the connection is called the 
correspondent node (CN). A simple mobile IP architecture is illustrated in Figure 11.11. 
In this example, the CN sends packets to the MN via the MN's HA and the FA. [Note that 
the term mobile node (MN) is used instead of mobile terminal (MT) in order to follow 
mobile IP conventions.] As mentioned previously, network organization introduces some 
differences in the way mobility management is handled over the Internet. For example, 
mobile IP allows MNs to communicate their current reachability information to their home 
agent without the use of databases [42]. As a result, mobile IP defines new operations for 
location and handoff management: 

(1) Discovery - how an MN finds a new Internet attachment point when it moves from 
one place to another. 

(2) Registration - how an MN registers with its HA, an Internet router on the MNs home 
network. 

(3) Routing and tunneling - how an MN receives datagrams when it is away from 
home [43]. 

Registration operations include mobile agent discovery, movement detection, forming 
care-of-addresses and binding updates, whereas handoff operations include routing and 
tunneling. Figure 11.12 illustrates the analogous relationships between the location man- 
agement operations for mobile IP and those previously described in Figure 1 1 . 1 for PLMN. 

Location registration 

When visiting any network away from home, each MN must have an HA. The MN registers 
with its home agent in order to track the MN's current IP address. There are two IP addresses 
associated with each MN, one for locating and the other one for identification. In the standard 
terminology, the new IP address associated with an MN while it visits a foreign link is called 
its care of address (CoA). The association between the current CoA and the MN's home 
address is maintained by a mobility binding, so that packets destined for the MN may 
be routed using the current CoA regardless of the MN's current point of attachment to 
the Internet. Each binding has an associated lifetime period, negotiated during the MN's 
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Figure 11.13 Mobile IP location registration. 



registration, and after which time the registration is deleted. The MN must reregister within 
this period in order to continue service with this CoA. 

Depending upon its method of attachment, the MN sends location registration messages 
directly to its HA, or through an FA that forwards the registration to the HA. In either case, 
the MN exchanges registration request and registration reply messages based on IPv4, as 
described below and shown in Figure 11.13. 

(1) By using a registration request message (the request may be relayed to the HA by 
the current FA), the MN registers with its HA. 

(2) The HA creates or modifies a mobility binding for that MN with a new lifetime. 

(3) The appropriate mobile agent (HA or FA) returns a registration reply message. The 
reply message contains the necessary codes to inform the mobile node about the 
status of its request and to provide the lifetime granted by the HA [43]. 
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Modifications in IPv6 

In IPv6, the FAs in Figure 11.13 no longer exist. The entities formerly serving as FAs are 
now access points (APs). 

Movement detection 

For the other backbone networks, the movement of the user is determined by updates 
performed when the user moves into a new LA. Since mobile IP does not use LAs to 
periodically update the network, a new feature to determine whether the MN has moved 
to a new subnet after changing its network APs is used. Mobile agents make themselves 
known by sending agent advertisement messages. The primary movement detection method 
for mobile IPv6 uses the facilities of IPv6 neighbor discovery. Two mechanisms used by 
the MN to detect movement from one subnet to another are the advertisement lifetime and 
the network prefix. 

Advertisement lifetime 

The lifetime field within the main body of the Internet control message protocol (ICMP) 
router advertisement portion of the agent advertisement is used. A mobile node records the 
lifetime received in any agent advertisements, until that lifetime expires. If the MN has not 
maintained contact with its FA, the MN must attempt to solicit a new agent [42]. 

Network prefix 

The second method uses the network prefix, a bit string that consists of some initial bits 
of an IP address, to detect movement. In some cases, an MN can determine whether or 
not a newly received agent advertisement was received on the same subnet as the MN's 
current CoA. If the prefixes differ, the MN can assume that it has moved. This method is 
not available if the MN is currently using an FA's CoA. 

After discovering that MN is on a foreign network, it can obtain a new CoA for this 
new network from the prefix advertised by the new router and perform location update 
procedures. For the PLMN, registration is implemented using database storage and retrieval. 
In Mobile IP, the MN's registration message creates or modifies a mobility binding at the 
home agent, associating the MN's home address with its new CoA for the specified binding 
lifetime. The procedure is outlined below and shown in Figure 11.14. 

(1) By sending a binding update, the MN registers a new CoA with its HA. 

(2) The MN notifies its CN of the MN's current binding information. 

(3) The CN and the HA send a binding request to the MN to get the MN's current 
binding information, if the binding update is allowed to expire. 

The MN responds to the binding request with its new binding update. After receiving the 
new CoA, the CN and HA send a binding acknowledgment to the MN. Once the registration 
process is complete, call delivery consists of reaching the MN via the new CoAs. A wireless 
network interface may allow an MN to be reachable on more than one link at a time (i.e. 
within the wireless transmitter range of routers on more than one separate link). This 
establishment of coexisting wireless networks can be very helpful for smooth handoff. 
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Figure 11.14 Mobile IP location management operations. 




Figure 11.15 Mobile IP smooth handoff with fresh binding at previous FA. 
Handoff management 

Current routing optimization schemes in IPv4 allow the previous foreign agent (or agents) 
to maintain a binding for their former mobile visitors, showing a current CoA for each. 
Then, as packets are sent to the old CoA, the corresponding previous foreign agents can 
forward the packets to the current CoA of the MN, as demonstrated in Figure 11.15. As a 
result, an MN is able to accept packets at its old CoA while it updates its home agent and 
correspondent nodes with a new CoA on a new link. If the previous FA does not have a 
fresh binding (the binding lifetime has expired), the previous FA forwards the packets to 
the home agent of the MN, which sends the packets to the CoA from the MN's last location 
registration update, as shown in Figure 11.16. This can potentially create unnecessary traffic 
if the HA's binding still refers to the previous FA. Alternatively, the previous FA can invoke 
the use of special tunnels which forward the packets, but also indicate the need for special 
handling at the HA. 
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Figure 11.16 Mobile IP smooth handoff without fresh binding at previous FA. 

When special tunnels are used, the packets that are sent to the HA are encapsulated with 
the FA's CoA address as the source IP address. Upon reception of the newly encapsulated 
packets, the HA compares the source IP address with the MN's most recent CoA. Thus, if 
the two addresses match, the packets will not be circled back to the FA. However, if the 
addresses do not match, the HA can decapsulate the packets and forward them to the MN's 
current CoA, as shown in Figure 11.16 [43]. In IPv6, the smooth handoff procedure is based 
on routers (IPv6 nodes) instead of FAs. 

The process of routing datagrams for an MN through its HA often results in the uti- 
lization of paths that are significantly longer than optimal. Route optimization techniques 
for mobile IP employ the use of tunnels, such as the special tunnels mentioned for smooth 
handoff, to minimize the inefficient path use. For example, when the HA tunnels a datagram 
to the CoA, the MN's home address is effectively shielded from intervening routers between 
its home network and its current location. Once the datagram reaches the agent, the original 
datagram is recovered and is delivered to the MN. Currently, there are two protocols for 
routing optimization and tunnel establishment: route optimization in mobile IP [64] and the 
tunnel establishment protocol [65]. 

The basic idea of route optimization is to define extensions to basic mobile IP protocols 
that allow for better routing, so that datagrams can travel from a correspondent node to a 
mobile node without going to the home agent first [64]. These extensions provide a means 
for nodes to cache the binding of an MN and then tunnel datagrams directly to the CoA 
indicated in that binding, bypassing the MN's home agent. In addition, extensions allow for 
direct forwarding to the MN's new CoA for cases such as datagrams that are in flight when 
an MN moves and datagrams that are sent based on an out-of-data cached binding. 

In tunnel establishment protocol, Mobile IP is modified in order to perform between 
arbitrary nodes [65]. Upon establishing a tunnel, the encapsulating agent (HA) transmits 
PDUs to the tunnel endpoint (FA) according to a set of parameters. The process of creating 
or updating tunnel parameters is called tunnel establishment. Generally the establishment 
parameters will include a network address for the MN. In order to use tunnel establishment 
to transmit PDUs, the home agent must determine the appropriate tunnel endpoint (FA) for 
the MN. This is done by consulting a table that is indexed by the MN's IP address. Each 
table entry contains the address of the appropriate tunnel endpoint, as well as any other 
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necessary tunnel parameters. After receiving the packets, the foreign agent may then make 
use of any of a variety of methods to transmit the decapsulated PDUs so that it can be 
received by the MN. If the MN resides at this particular FA, no further network operations 
are required. 

4G mobile IP networks will have to additionally address several issues. Security is one of 
the most important. As mentioned for the PLMN, the authentication of the mobile becomes 
more complex as the MN's address loses its tie to a permanent access point. This allows 
for a greater opportunity for impersonating an MN in order to receive services. Thus secu- 
rity measures for the registration and update procedures, specifically protecting the CoAs 
and HAs, must be implemented in order to police terminal use [43]. Some authentication 
schemes for the MN, the HA, and the FA can be found in Troxel and Sanchez [66]. See also 
Chapter 15 of this book. 

The next issue is simultaneous binding. Since an MN can maintain several CoAs at 
one time, the HA must be prepared to tunnel packets to several endpoints. Thus, the HA is 
instructed to send duplicate encapsulated datagrams to each CoA. After the MN receives the 
packets from the CoAs, it can invoke some process to remove the duplicates. If necessary, 
the duplicate packets may be preserved in order to aid signal reconstruction. 

Options for regionalized registration should also be considered, the extreme case being 
the BIONET concept discussed in Chapter 1. Currently, three major concepts have been 
identified as potential methods for limiting location update and registration cost. First, there 
is a need for schemes that manage the local connectivity available to the MN and also to 
manage the buffering of datagrams to be delivered. Through this, the network can benefit 
from smooth handoffs without implementing route optimization procedures. Second, a 
multicast group of foreign agents is needed in order to allow the MN to use a multicast IP 
address as its CoA. Third, a hierarchy of foreign agents can be used in agent advertisement 
in order to localize the registrations to the lowest common FA of the CoA at the two points 
of attachments. To enable this method, the MN has to determine the tree-height required 
for its new registration message, and then arrange for the transmission of this message to 
reach each level of the hierarchy between itself and the lowest common ancestor of its new 
and previous CoA [44], 

As already discussed in Chapter 1 , 4G is all about integrating different wireless networks. 
So intertechnology roaming is the central issue in 4G wireless networks. The first steps in 
that direction have already been undertaken. Recently, there has been some discussion 
regarding Mobile IP with respect to the third-generation IMT 2000 system. A high-level IP 
mobility architecture is described in Becker et al. [69], in which the diverse nature of today's 
wireless and wireline packet data networks is explored. To support the seamless roaming 
among the heterogeneous networks, the mobility management based on mobile IP concepts 
is extended to the current third generation IMT2000 wireless architecture. Another concept, 
referred to as simple mobile IP (SMIP) [68], searches for a more simplistic approach to 
support the mobility of users, compared with the asymmetric triangular approach proposed 
in IPv6. SMIP employs a more symmetric and distributed solution for location management 
based on MN connections to fixed network routers that have added mobility functions. 

Mobility management in wireless ATM deals with transitioning from ATM cell transport 
based upon widely available resources over wireline to cell transport based upon the lim- 
ited and relatively unreliable resources over the wireless channel. For details of networks 
architectures see Chapter 1. Thus, the issues such as latency, message delivery, connection 
routing and QoS [67] will be in the focus of discussion. The ATM Forum (WATM Working 
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Figure 11.17 ATM location management techniques. 



Group), is developing basic mechanisms and protocol extensions for location and handoff 
management that address these issues. The Forum has specified that new procedures must 
be compatible with the current ATM standards in order to be implemented with relative 
ease and efficiency [45]. As a result, many of the procedures are also compatible with PCS, 
satellite and to a lesser degree mobile IP concepts. In this section, we discuss selected 
proposed solutions for location management, terminal paging, and handoff. 

Options considered for 4G are summarized in Figure 11.17. Proposed protocols for 
WATM implement location management using three techniques: location servers, location 
advertisement and terminal paging. 

Location servers refer to the use of databases to maintain records of the attachment point 
of MTs within the network. As discussed earlier, the storage and retrieval process can gener- 
ate excessive signaling and querying operations. Location advertisement avoids the use of 
databases by passing location information throughout the network on broadcast messages. 
Terminal paging is employed to locate MTs within the service area of its attachment point, 
as discussed previously. 

Location server techniques are based on location servers (the databases) used to store and 
retrieve a record of the current position of the mobile. They require querying operations, 
as well as signaling protocols for storage and retrieval. WATM server protocols employ 
the IS-41/GSM based techniques that were discussed for the PLMN backbone earlier in 
this section. The first method makes familiar use of the HLR/VLR database structure. The 
second algorithm, location registers (LRs), uses a hierarchy of databases. 

The two-tier database uses bilevel databases that are distributed to zones throughout 
the network, as shown in Figure 11.18. The zones, analogous to the LAs, are maintained 
by a zone manager. The zone manager, analogous to the mobility service control point 
(MSCP) of the 4G wireless architecture, controls the zone's location update procedures. 
The home tier (HLR) of the zone's database stores location information regarding MTs 
that are permanently registered within that zone, while the second tier (VLR) of the zone's 
database stores location information on MTs that are visiting the zone. Each MT has a home 
zone, i.e. a zone in which it is permanently registered. 
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Figure 11.18 Two-tier database scheme. 




ATM switches 



Figure 11.19 LR Hierarchy of WATM location registers scheme. 

Upon entering a new zone, the MT detects the new zone identity broadcast from the BSs. 
The steps for registration, shown in Figure 11.19, are: 

(1) The MT transmits a registration request message to the new MSCP that contains 
its user identification number (UID), authentication data, and the identity of the 
previous zone. 

(2) The current MSCP determines the home zone of the MT from the previous zone. 

(3) The current and home MSCPs authenticate the user and update the home user profile 
with the new location information. 
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(4) The home zone sends a copy of the profile to the current zone, which stores the 
profile in the VLR tier of its database. 

(5) The current MSCP sends a purge message to the previous zone so that the user's 
profile is deleted from the previous zone's VLR tier. 

Call delivery is achieved by routing the call to the last known zone first. If the MT has 
moved and has been purged, the call is immediately forwarded to the home zone. The home 
zone's HLR is queried for the current location of the MT, which is forwarded back to the 
calling switch. The calling switch can then set up a connection to the current serving switch 
of theMT. 

The advantage of the two-tier scheme is that it keeps the number of queries low, requiring 
at most two database lookups for each incoming call to find the MT. However, the use of 
a centralized HLR may cause increased signaling traffic and unnecessary connection set- 
up delays if the MT makes several localized moves for an extended period of time. A 
more localized approach may reduce the need for long-distance queries and thereby reduce 
connection set-up delays. 

The LR hierarchy-based scheme, described in Veeraraghavan and Dommetry [56] 
and shown in Figure 11.19, distributes location servers throughout a hierarchical private 
network-to-network interface (PNNI) architecture. The PNNI procedure is based on a hi- 
erarchy of peer groups, each consisting of collections of ATM switches. Each switch can 
connect to other switches within its peer group. Special switches, designated peer group 
leader, can also connect to a higher ranking leader in the 'parent' peer group. Each peer 
group also has its own database, or LR, used to store location information on each of the 
MTs being serviced by the peer group. 

The PNNI organization allows the network to route connections to the MT without 
requiring the parent nodes to have exact location information. Only the lowest referenced 
peer must record the exact location, and the number of LR updates then corresponds to the 
level of mobility of the MT. As an illustration, a connection being set up to a MT located at 
switch A. 2.2 in Figure 1 1.19 is first routed according to the highest boundary peer group 
and switch A. Peer A can then route the connection to its 'child' peer group, level A.x to 
switch A.2. Finally, the connection is routed by A.2 to the lowest peer group level to switch 
A. 2.2, which resolves the connection to the MT. 

Thus, for movement within the A.2 peer group, the location update procedure can be 
localized to only the LR of that peer group. However, a movement from peer group B.l 
to peer group A.2 requires location registration of a larger scope, and the maintenance 
of a home LR to store a pointer to the current parent peer position of the MT. To limit 
signaling for the larger scale moves to the minimum necessary level,Veeraraghavan and 
Dommetry [56] use two scope-limiting parameters, S and L. The S parameter indicates 
a higher level peer group boundary for LR queries, while the L parameter designates the 
lowest group. In Figure 11.19, the current S level is level one, while the L level is level 
two. 

When the MT performs a location update by sending a registration notification message 
to the new BS, this message is relayed to the serving switch, which then stores the MT's 
location information in the peer group's LR. When the MT powers on or off, this message is 
relayed up the hierarchy until it reaches the preset boundary S. The 5-level register records 
the entry and then relays the message to the MT's home LR. As an illustration, for movement 
from position B. 1.2 to position A.2.2, the registration procedure, shown in Figure 11.19, is 
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as follows: 

(1) The MT sends a registration notification message to the new BS/switch. 

(2) The new switch stores the MT in the peer group's LR. 

(3) The peer group relays the new location info to the higher level LR's for routing, 
stopping at the first common ancestor of the former and current peer groups. 

(4) In this case, the former S level is not a common ancestor, so a new S level is designated 
and the location info stops propagating at the new S level, level 0. 

(5) The MT's home LR (located at group A.x) is notified of the new S-level location for 
the MT. 

After the updates are complete, the new switch sends a purge message to the previous switch 
so that the former location can be removed from the LRs. 

Call delivery 

An incoming call request can be routed to the last known peer group or switch via the 
S-level LR. If the mobile has moved, the last known switch propagates a location request, 
querying the upstream LRs until the mobile endpoint's address is recognized by an LR that 
has a pointer to the mobile's current position. Then the request is sent to the L-level LR for 
that peer group, which resolves the query and sends the location information back to the 
calling switch. 

Finally, if the call request reaches the S level before being recognized by an LR, the 
S-level LR forwards the location request directly to the home switch. Since the home LR 
keeps track of S-level changes for its mobile, the home switch can forward the request 
directly to the correct S-level switch, whose LR points to the current peer group position 
of theMT. 

Location advertisement techniques 

Although the LR hierarchy provides the advantages of simplicity, decreased computation 
costs and flexibility, the method can still require a substantial signaling and database query- 
ing load. This load can be reduced by using location advertisement. For WATM, advertise- 
ment refers to the notification of appropriate network nodes of the current location of the MT. 
The first method, mobile PNNI, uses the PNNI architecture described above by removing the 
LRs and by taking advantage of an internal broadcast mechanism [56]. The second method, 
destination-rooted virtual connection trees, advertises location information via provisioned 
virtual paths [57]. The third method, integrated location resolution, extends the signaling 
framework of ATM with location information elements that incorporate location resolution 
into the connection set-up process [4]. For more details see the references. 

The terminal paging problem has not yet been explored enough for WATM applications, 
and additional work will be needed in this area. 

1 1. 1.2. 7 WATM handoff management in 4G wireless networks 

Handoff management controls the process of maintaining each connection at a certain level 
of QoS as the MT moves into different service areas [29]. As illustrated in Figure 11.20, 



328 MOBILITY MANAGEMENT 




Figure 1 1.20 WATM handoff management techniques. 

potential protocols to be used in 4G, can be grouped into four categories: 

(1) full connection rerouting; 

(2) route augmentation; 

(3) partial connection rerouting; and 

(4) multicast connection rerouting. 

Full connection rerouting maintains the connection by establishing a completely new route 
for each handoff, as if it were a brand new call [46] . Route augmentation simply extends the 
original connection via a hop to the MT's next location [46] . Partial connection rerouting re- 
establishes certain segments of the original connection, while preserving the remainder [9]. 
Finally, multicast connection rerouting combines the former three techniques but includes 
the maintenance of potential handoff connection routes to support the original connection, 
reducing the time spent in finding a new route for handoff [9]. More details can be found 
in the above references. 



1 1. 1.2.8 Mobility management for satellite networks 

In 4G integrated wireless networks the LEO satellites would cover regions where building 
terrestrial wireless systems are economically infeasible due to rough terrain or insufficient 
user population. A satellite system could also interact with terrestrial wireless network to 
absorb the instantaneous traffic overload of the terrestrial wireless network. 

LEO satellites are usually those with altitudes between 500 and 1500 km above the 
Earth's surface [70-72]. This low altitude provides small end-to-end delays and low power 
requirements for both the satellites and the handheld ground terminals. In addition, inter- 
satellite links (ISL) make it possible to route a connection through the satellite network 
without using any terrestrial resources. These advantages come along with a challenge; 
in contrast to geostationary (GEO) satellites, LEO satellites change their position with 
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reference to a fixed point on the Earth. Owing to this mobility, the coverage region of an 
LEO satellite is not stationary. A global coverage at any time is still possible if a certain 
number of orbits and satellites are used. The coverage area of a single satellite consists of 
small-sized cells, which are referred to as spotbeams. Different frequencies or codes are 
used in different spotbeams to achieve frequency reuse in the satellite coverage area. 

Location management in the LEO satellite network environment represents more chal- 
lenging problem because of the movement of satellite footprints. As a consequence, an LA 
cannot be associated with the coverage area of a satellite because of very fast movement 
of a LEO satellite. Thus, 4G will need the development of new LA definitions for satel- 
lite networks as well as the signaling issues mentioned for all of the location management 
protocols. In del Re [47], LAs are defined using (gateway, spotbeam) pairs. However, the 
very fast movement of the spotbeams results in excessive signaling for location updates. 
In Ananasso and Priscoli [73], LAs are defined using only gateways. However, the paging 
problem has not been addressed in the same reference. 

Handoff management ensures that ongoing calls are not disrupted as a result of satel- 
lite movement, but rather transferred or handed off to new spotbeams or satellites when 
necessary. If a handoff is between two spotbeams served by the same satellite, handoff is 
intrasatellite. The small size of spotbeams causes frequent intrasatellite handoff s, which 
are also referred to as spotbeam handoff s [74]. If the handoff is between two satellites, it is 
referred to as intersatellite handoff. Another form of handoff occurs as a result of the change 
in the connectivity pattern of the network. Satellites near to polar regions turn off their links 
to other satellites in the neighboring orbits. Ongoing calls passing through these links need 
to be rerouted. This type of handoff is referred to as link handoff [59, 60]. Frequent link 
handoffs result in a high volume of signaling traffic. Moreover, some of the ongoing calls 
would be blocked during connection rerouting caused by link handoffs. 



1 1 .2 CELLULAR SYSTEMS WITH PRIORITIZED HANDOFF 

The handoff attempt rate in a cellular system depends on cell radius and mobile speed 
as well as other system parameters. As a result of limited resources, some fraction of 
handoff attempts will be unsuccessful. Some calls will be forced to terminate before message 
completion. In this section we discuss analytical models to investigate these effects and 
to examine the relationships between performance characteristics and system parameters. 
For these purposes some assumptions about the traffic nature are needed. We assume that 
the new call origination rate is uniformly distributed over the mobile service area with 
the average number of new call originations per second per unit area A a . A very large 
population of mobiles is assumed, thus the average call origination rate is for practical 
purposes independent of the number of calls in progress. A hexagonal cell shape is assumed. 
The cell radius R for a hexagonal cell is defined as the maximum distance from the center of 
a cell to the cell boundary. With the cell radius R, the average new call origination rate per 
cell A R is A R — 3 ~J2>R 2 A a /2. Average handoff attempt rate per cell is A R ^. The ratio yo of 
handoff attempt rate to new call origination rate (per cell) is y 0 = A Rh /A R .lfa fraction P B 
of new call origination is blocked and cleared from the system, the average rate at which 
new calls are carried is Ar c = A R (1 — Pr). Similarly, if a fraction Pq, of handoff attempts 
fails, the average rate at which handoff calls are carried is A S hc = A S h(l — Pfh)- The ratio 
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y c of the average carried handoff attempt rate to the average carried new call origination 
rate is defined as y c = A Rhc /A Rc = y 0 (l - - Pb)- 

The channel holding time 7h in a cell is defined as the time duration between the instant 
that a channel is occupied by a call and the instant it is released by either completion of the 
call or a cell boundary crossing by the mobile. This is a function of system parameters 
such as cell size, speed and direction of mobiles, etc. To investigate the distribution of T H 
we let the random variable T M denote the message duration, that is, the time an assigned 
channel would be held if no handoff is required. The random variable 7m is assumed to be 
exponentially distributed fr M (t) — /u,Me~ MM 'with the mean value 7m(= 1/Mm)- The speed 
in a cell is assumed to be uniformly distributed on the interval [0, V mm ]. 

When a mobile crosses a cell boundary, the model assumes that vehicular speed and 
direction change. The direction of travel is also assumed to be uniformly distributed and in- 
dependent of speed. More sophisticated models would assume that the higher the speed the 
fewer changes in direction are possible. 

The random variable T n is the time a mobile resides in the cell to which the call is 
originated. The time that a mobile resides in the cell in which the call is handed off is 
denoted T^. The pdfs f T „(t) and fn(t) will be discussed in Section 1 1.3. 

When a call is originated in a cell and gets a channel, the call holds the channel until 
the call is completed in the cell or the mobile moves out of the cell. Therefore, the channel 
holding time r H „ is either the message duration T M or the time T„ for which the mobile 
resides in the cell, whichever is less. For a call that has been handed off successfully, the 
channel is held until the call is completed in the cell or the mobile moves out of the cell 
again before call completion. 

Because of the memoryless property of the exponential distributions, the remaining 
message duration of a call after handoff has the same distribution as the message duration. 
In this case the channel holding time 7^ is either the remaining message duration T M or 
mobile residing time 7], in the cell; whichever is less. The random variables T Hn and Th/, 
are therefore given by 

r H „ = min(7M, T„) and T m = min(7M, T h ) (11.1) 

The cumulative distribution functions (CDF) of T Hn and Tnh can be expressed as 

F Tm (f) = F T Jt) + F Tn (W - F TM (t)] 
F Tm {t) = F T Jt) + F Th (t)[l - F TM (t)] 

The distribution of channel holding time can be written as 

A Rc + A flhc A «c + A «hc 

^(0 + 7^-^(0 (H- 3 ) 



l + Kc l + y< 

1 

l + Kc 



Frjt) + ~7~~~ [1 - F Tm {t)] [F Tn (t) + y c F Tn (t)] 



From the initial definitions, 

F TAt) =\ l - e - m ' + Sl F ^ + Y< F ^> fOT ^° (11.4) 
10, elsewhere 
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The complementary distribution function F C T H (t) is 
F c T H (t)=l-F TH (t) = F TH (t) 



1 - e^ M ' - — — \F T (t) + y c F Th (f)l , for (>0 ( n - 5 ) 
1 + Yc 

0, elsewhere 
By differentiating Equation (1 1.4) we get the probability function (PDF) of 7h as 

q— Mm' m g- Mm' 

/t-h (0 = MMe"" M ' + — — [/r„ (0 + Ycf Th (0] " ^TT [fr. (0 + Kc^r h (*)] 

(11.6) 

To simplify the analysis the distribution of T H is approximated in References [75, 76] by 
a negative exponential distribution with mean f H (= 1 //xh)- From the family of negative 
exponential distribution functions, a function which best fits the distribution of 7h, by 
comparing Fj h (f ) and e - ' 1 "' is chosen which is defined as 

oo 

/x H =» mm J [F£ ( f ) - e"' 1 "'] df (1 1.7) 

o 

Because a negative exponential distribution function is determined by its mean value, we 
choose 7h(=1//Xh), which satisfies the above condition. The 'goodness of fit' for this 
approximation is measured by 



/>oo 

/ | F r H (0-e- 
Jo 



m '\6t 

(11.8) 



— /-OO 

2/ F!~Jt)dt 
Jo 

In the sequel the following definitions will be used: 

(1) The probability that a new call does not enter service because of unavailability of 
channels is called the blocking probability, P B - 

(2) The probability that a call is ultimately forced into termination (though not blocked) 
is Pp. This represents the average fraction of new calls which are not blocked but 
which are eventually uncompleted. 

(3) Pfh is the probability that a given handoff attempt fails. It represents the average 
fraction of handoff attempts that are unsuccessful. 

(4) The probability P N that a new call that is not blocked will require at least one handoff 
before completion because of the mobile crossing the cell boundary is 

OO 00 

P N = Pr{7 M > T n ] = J [1 - F Tm (0] f Tn (t)dt = j e-^'fr, (t)dt (11.9) 
o o 
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(5) The probability P H that a call that has already been handed off successfully will 
require another handoff before completion is 

oo oo 

P H = Pr{T M > T h } = j [1 - F Tu (t)] f Th (t) At = j e-^'/ji (t)dt (11.10) 

0 0 

Let the integer random variable K be the number of times that a nonblocked call is success- 
fully handed off during its lifetime. The event that a mobile moves out of the mobile service 
area during the call will be ignored since the whole service area is much larger than the 
cell size. A nonblocked call will have exactly K successful handoffs if all of the following 
events occur: 

(1) It is not completed in the cell in which it was first originated. 

(2) It succeeds in the first handoff attempt. 

(3) It requires and succeeds in k — 1 additional handoffs. 

(4) It is either completed before needing the next handoff or it is not completed but fails 
on the (k + l)st handoff attempt. 

The probability function for K is therefore given by 

Pr{* = 0} = (1 - P N ) + P N P ft 



Yx{K =k} = P N (1 - P ft )(l - P H + P H Pft){PH(l - Pa)}*" 1 , k = 1, 2, 



and the mean value of K is 



(11.11) 



K = YkPr{K = k}= Pn(1 Pfh) (11.12) 
1 - Ph(1 - Pa) 

If the entire service area has M cells, the total average new call attempt rate which is not 
blocked is MA « c , and the total average handoff call attempt rate is KM A # c . If these traffic 
components are equally distributed among cells, we have y c — (KMA Rc )/(MA Rc ) = K. 



1 1 .2.1 Channel assignment priority schemes 

The probability of forced termination can be decreased by giving priority (for channels) to 
handoff attempts (over new call attempts). In this section, two priority schemes are described, 
and the expressions for Pb and Pq, are derived. A subset of the channels allocated to a cell is 
to be exclusively used for handoff calls in both priority schemes. In the first priority scheme, 
a handoff call is terminated if no channel is immediately available in the target cell (channel 
reservation - CR handoffs). In the second priority scheme, the handoff call attempt is held 
in a queue until either a channel becomes available for it, or the received signal power level 
becomes lower than the receiver threshold level (channel reservation with queueing - CRQ 
handoffs). 



1 1 .2.2 Channel reservation - CR handoffs 

Priority is given to handoff attempts by assigning Ch channels exclusively for handoff calls 
among the C channels in a cell. The remaining C — Ch channels are shared by both new 
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MH 2m h (C-C h )h„ (C-C„+l)|i„ Cn H 

Figure 1 1.21 State-transition diagram for channel reservation - CR handoffs. 



calls and handoff calls. A new call is blocked if the number of available channels in the cell 
is less than or equal to Q, when the call is originated. A handoff attempt is unsuccessful if 
no channel is available in the target cell. We assume that both new and handoff call attempts 
are generated according to a Poisson point process with mean rates per cell of A« and A R h, 
respectively. As discussed previously, the channel holding time 7h in a cell is approximated 
to have an exponential distribution with mean r H (=l//XH)- We define the state Ej of a 
cell such that a total of j calls is in the progress for the base station of that cell. Let Pj 
represent the steady-state probability that the base station is in state Ej ; the probabilities 
can be determined in the usual way for birth-death processes discussed in Chapter 6. The 
pertinent state-transition diagram is shown in Figure 11.21. 
The state equations are 



A 



R 



A 



Rh 



A Rh „ 



"7-1. 



Pj-u for j = 1,2,... ,C-C b 

for j = C - C h + 1, . . . , C 



(11.13) 



As in Chapter 6, by using Equation (11.13) recursively, along with the normalization con- 
dition Pj — 1> me probability distribution { Pi } is 



7=0 



Pi = 



' c ~ Ch (A* + A Rh ) k 



E 



k=C-Ci+l 



Po, 



f (a r + A Rh y 



(Ar + A Rh ) c ~ c » A j - h (C Ch) 



C /a i A \C-C b A k~(C-C h ) 

yv (A R + A;>j, ) - A A , h 



for j = 1, 2, . , . , C — Ch 



(11.14) 



P 0 , for j = C-C h + l, 



C 



The probability of blocking a new call is Pb = Z]j=c-c h Pj tne probability of handoff 
attempt failure P^ is the probability that the state number of the base station is equal to C. 
Thus Pa = P c - 



1 1 .2.3 Channel reservation with queueing - CRQ handoffs 



When a mobile moves away from the base station, the received power generally decreases. 
When the received power gets lower than a handoff threshold level, the handoff procedure 
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Figure 1 1.22 Call flow diagram for channel reservation with queueing-CRQ handoffs. 

is initiated. The handoff area is defined as the area in which the average received power 
level from the base station of a mobile receiver is between the handoff threshold level 
{upper bound) and the receiver threshold level (lower bound). If the handoff attempt finds 
all channels in the target cell occupied, we consider that it can be queued. If any channel 
is released while the mobile is in the handoff area, the next queued handoff attempt is 
accomplished successfully. If the received power level from the source cell's base station 
falls below the receiver threshold level prior to the mobile being assigned a channel in the 
target cell, the call is forced into termination. When a channel is released in the cell, it is 
assigned to the next handoff call attempt waiting in the queue (if any). If more than one 
handoff call attempt is in the queue, the first-come-first-served queuing discipline is used. 
The prioritized queueing is also possible where the fast moving (fast signal level losing) 
users may have higher priority. We assume that the queue size at the base station is unlimited. 
Figure 1 1.22 shows a schematic representation of the flow of call attempts through a base 
station. 

The time for which a mobile is in the handoff area depends on system parameters such 
as the speed and direction of mobile travel and the cell size. We call it the dwell time of a 
mobile in the handoff area Tq. For simplicity of analysis, we assume that this dwell time 
is exponentially distributed with mean 7q(=1//zh). We define Ej as the state of the base 
station when j is the sum of the number of channels being used in the cell and the number of 
handoff call attempts in the queue. For those states whose state number j is less than equal 
to C, the state transition relation is the same as for the CR scheme. Let X be the elapsed 
time from the instant a handoff attempt joins the queue to the first instant that a channel is 
released in the fully occupied target cell. For state numbers less than C, X is equal to zero. 
Otherwise, X is the minimum remaining holding time of those calls in progress in the fully 
occupied target cell. When a handoff attempt joins the queue for a given target cell, other 
handoff attempts may already be in the queue (each is associated with a particular mobile). 
When any of these first joined the queue, the time that it could remain on the queue without 
succeeding is denoted by Tq (according to our previous definition). Let 7) be the remaining 
dwell time for that attempt which is in the ith queue position when another handoff attempt 
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Figure 1 1.23 State-transition diagram for CRQ priority scheme. 



joins the queue. Under the memoryless assumptions here, the distributions of all T-, and Tq 
are identical. Let N (f ) be the state number of the system at time t. From the description of 
this scheme and the properties of the exponential distribution it follows that 



P r {N(t + h) = C +k- l\N(t) = C + k} 

= P r {X <h or 7] < h or . . . T k < h] 

— 1 — P r {X > h and T\ > h or . . . 7i > h] 

= 1 - P r {X > h] P r {Ti > h] . . . P r \T k > h] 



(11.15) 



since the random variables X, T\, T2, . . . , T k are independent. From Equation (11.15) we 
see that it follows the birth-and-death process and the resulting state transition diagram is 
as shown in Figure 1 1.23. 

As before, the probability distribution {Pj} is easily found to be 



Po = 



C g h (A s + A Rh ) k 



k=0 



klfl H k 



y-v (AR + A A . h ) A A , h 



£ = C-C h +1 



k\fl k H 



00 (\ 1 A \C-C,, A k-(C-C,,) 

_l_ (Afi + A /.;,,■ A /(h 



jfc-C 



(=i 



f (A g + Affl,V 



Po, 



I A _1_ A -iC-C;, A ,/-(C-C 4 ) 

(Ar + A sh ) A Rh 

j-c 

C!/Xh EI [Cm + i^Q) 

i=l 



for i<y<c-c h 

P 0 , for C - C h + 1 < 7 < C 



(11.16) 



Po, for 7 > C + 1 



The probability of blocking is Pb = Y^jLc-c b ^j- ^ gi yen handoff attempt that joins the 
queue will be successful if both of the following events occur before the mobile moves out 
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of the handoff area: 

(1) All of the attempts that joined the queue earlier than the given attempt have been 
disposed. 

(2) A channel becomes available when the given attempt is at the front of the queue. 

Thus the probability of a handoff attempt failure can be calculated as the average fraction of 
handoff attempts whose mobiles leave the handoff area prior to their coming into the queue 
front position and getting a channel. Noting that arrivals that find k attempts in queue enter 
position k + 1, this can be expressed as 

oo 

Pm= £>c+*Pfh|* (H.17) 

k=0 

where Pfhii = P r {attempt fails given it enters the queue in position k — 1}. 

Since handoff success for those attempts which enter the queue in position k + 1 requires 
coming to the head of the queue and getting a channel, under the memoryless conditions 
assumed in this development, we have 



(l-P m ) 



Y\P(i h' + i) 



P r { get channel in first position} (11.18) 



whereP (i | i + 1) is the probability that an attempt in position i + 1 moves to position i 
before its mobile leaves the handoff area. 

There are two possible outcomes for an attempt in position i + 1 . It will either be cleared 
from the system or will advance in queue to the next (lower) position. It will advance if the 
remaining dwell time of its mobile exceeds either: 

(1) at least one of the remaining dwell times Tj, j = 1, 2, . . . , i, for any attempt ahead 
of it in the queue; or 

(2) the minimum remaining holding time X of those calls in progress in the target cell. 
Thus 

l-P(i\i + l) = P r {T i+1 <X,T i+l <Tj, 7 = 1,2 i} / = 1,2 

(11.19) 

1 - P(i \ i + l)= P r {T i+l < X, T i+ i <T U ..., T i+l < T t ] 

= P r {T i+l < min(X, T U T 2 ,..., T\)} (11.19a) 
= P r {T i+ i < Yi) i = l,2,... 



where Y, = min(X, T\, 72, ... , 7}). Since the mobiles move independently of each other and 
of the channel holding times, the random variables, X, Tj , (j = 1, 2, . . . , i) are statistically 
independent. Therefore, the cumulative distribution of Y t in Equation (11.19) can be written 

as 



F Yi (x) = !-{!- F x (x)}{\ - F Tl (r)} ... {1 - F r| (T)} 
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Because of the exponentially distributed variables, this gives 

F Y (t) = 1 — e^c^HTg-Aigi e -Me r — j _ e -(CMH+i>e) T 

and Equation (11.19) becomes 

oo 

1 - P(i \i + 1) = p r {T i+l < Yj] = J{1- F Yi {x)}f Ti+1 {x)Ax 



DC 

f 



o 



(11.20) 



e -(C MH + ! > e )r AtQe -M e r dT = MO f 1 = 1,2,... 

C/x H + 0 + l)/x e 



The handoff attempt at the head of the queue will get a channel (succeed) if its remaining 
dwell time 7\ exceeds X. Thus 



P r {get channel in front position} = P-{Pi > X] and 
P r {does not get channel in front position} = P r {T\ < X} (11-21) 



Me 



Cmh + Me 

o 

The probability Equation (11.21) corresponds to letting i — 0 in Equation (11.20) Then 
from Equation ( 1 1 . 1 8) we have 



1 - Pan 



Y\pn\i + \) 



P r {get channel in first position} 



Cm-h + Me c Mh + 2fi Q Cfin + kfiQ Cfiu (ii22) 



C/x H + 2/x G C/x H + 3/i e C/x H + (^+ l)Mo C MH + Me 
C/x H + (^ + 1)Mo 



and 



_ (fc+i)Me ni „, 

Pfh|* - 7; —t, . r . (11.23) 

The above equations form a set of simultaneous nonlinear equations which can be solved 
for system variables when parameters are given. Beginning with an initial guess for the un- 
knowns, the equations are solved numerically using the method of successive substitutions. 

A call which is not blocked will be eventually forced into termination if it succeeds in 
each of the first (/ — 1) handoff attempts which it requires but fails on the Ith. Therefore, 

ft = £ P ft [Pn (1 - P.)'-' PL' 1 ] = 1 _ pfc) (H.24) 

where P N and P H are the probabilities of handoff demand of new and handoff calls, as 
defined previously. Let P nc denote the fraction of new call attempts that will not be com- 
pleted because of either blocking or unsuccessful handoff. This is also an important system 
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performance measure. This probability P nc can be expressed as 

PfhPn (1 - Pb) 

P m = P B + PF (l- Pb ) = Pb + NV 1L (1 1.25) 

I — "h ( l — Pfh) 

where the first and second terms represent the effects of blocking and handoff attempt 
failure, respectively. In Equation (1 1.25) we can guess roughly that, when cell size is large, 
the probabilities of cell crossing Pn and Ph will be small and the second term of Equation 
(11.25) (i.e. the effect of cell crossing) will be much smaller than the first term (i.e. effect 
of blocking). However, when the cell size is decreased, Pn and Ph will increase. The 
noncompleted call probability P nc can be considered as a unified measure of both blocking 
and forced termination effects. 

Another interesting measure of system performance is the weighted sum of Pb and Pp 

CF= (1 -a)P B +aP F (11.26) 

where a is in the interval [(0, 1)] and indicates the relative importance of the blocking and 
forced termination effects. For some applications P F may be more important than P B from 
the user's point of view, and the relative cost a can be assigned using the system designer's 
judgment. 



11.2.4 Performance examples 

Forthe calculations, the average message duration was taken as 7m = 120s and the maximum 
speed of a mobile of V mlix = 60 miles/h was used. The probabilities P B and P F as functions 
of (new) call origination rate per unit area A a can be seen in Figure 1 1 .24, with cell radius 
R being a parameter. A total of 20 channels per cell (C = 20) and one channel per cell for 
handoff priority (Ch = 1) was assumed. The CRQ scheme was used for this figure, and the 
mean dwell time for a handoff attempt Tq was assumed to be 7h/10. As can be seen, Pp 
is much smaller than Pb and the difference between them decreases as cell size decreases. 
As expected, for larger R the effect of handoff attempts and forced terminations on system 
performance is smaller. 
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Figure 1 1.24 Blocking and forced termination probabilities for CRQ priority scheme. 
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Figure 1 1 .25 Blocking and forced termination probabilities for CRQ systems with 20 chan- 
nels/cell, R — 2 miles. 
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Figure 11.26 Blocking and forced terminations for priority CR and CRQ schemes (20 
channels/cell, one handoff channel/cell, R — 2 miles). 

Figure 11.25 shows P B and P F as functions of A a . As the effects of increasing priority 
given to handoff calls over new calls by increasing Q,, Pf decreases by orders of magnitude 
with only small to moderate increase in Pb this exchange is important because (as was 
mentioned previously) forced terminations are usually considered much less desirable than 
blocked calls. 

Blocking and forced termination probabilities for the two priority schemes are shown 
in Figure 11.26 as functions of call origination rate density A a . The forced termination 
probability Pp is smaller for th CRQ scheme, but almost no difference exists in blocking 
probability Pb ■ We get this superiority of the CRQ priority scheme by queuing the delayed 
handoff attempts for the dwell time of the mobile in the handoff area. 
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11.3 CELL RESIDING TIME DISTRIBUTION 

In this section we discuss the probability distributions of the residing times T n and 7],. The 
random variable T n is defined as the time (duration) that a mobile resides in the cell in which 
its call originated. Also 7], is defined as the time a mobile resides in a cell to which its call 
is handed off. To simplify analysis we approximate the hexagonal cell shape as a circle. For 
a hexagonal cell having radius R, the approximating circle with the same area has a radius, 
7? eq , which is given by R eq — y/(3\f3/2it) R «s 0.917? and illustrated in Figure 1 1.27. The 
base station is assumed to be at the center of a cell and is indicated by a letter B in the 
figure. The location of a mobile in a cell, which is indicated by a letter A in the figure, is 
represented by its distance r and direction <f> from the base station as shown. To find the 
distributions of T„ and 7h, we assume that the mobiles are spread evenly over the area of 
the cell. Then r and <p are random variables with PDFs 



Mr) 



2r 
0, 



0 < r < R eq 
elsewhere 



M4>) 



l 



0 < <p < 2jt 



2jt 

0, elsewhere 



(11.27) 



Next it is assumed that a mobile travels in any direction with equal probability and its 
direction remains constant during its travel in the cell. If we define the direction of mobile 
travel by the angle 9 (with respect to a vector from the base station to the mobile), as 
shown in the figure, the distance Z from the mobile to the boundary of approximating 
circle is Z = Vt^eq ~~ ( r sin(9) 2 ] — r cos 6*. Because (p is evenly distributed in a circle, Z 
is independent of </> and from the symmetry we can consider the random variable 6 is in 
interval [0, n] with PDF 



fe{0) 



1 



o < e < n 



(11.28) 



0, elsewhere 




Figure 1 1 .27 Illustration of distance from point A in cell (where call is originated), to point 
C on cell boundary (where mobile exits from cell). 
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If we define new random variables x, yasx — r cosS, y — r sin 6, then Z— (R^ — y 2 ) — x 



and W — x. Since the mobile is assumed to be equally likely to be located anywhere in the 
approximating circle 



r 2 



fxr(x, y) - 



——, -R eq <x< R eq , 0<x 2 + y 2 < R 2 0 < y < R eq 



•eq 



0, elsewhere 



From Equations (1 1.27)— (1 1.28), the joint density function of Z and W can be found by 
standard methods 



fzwiz, w) = 



\z + w\ 



V^eq 2 " (Z + W) 2 

2 \z + w\ 



fxr(x, y) 



0<z< 2R eq , --z <w<-z + R eq 



The PDF of the distance Z is then becomes 
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0, elsewhere 



(11.29) 



If the speed V of a mobile is constant during its travel in the cell and random variable which 
is uniformly distributed on the interval [0, V ma x] with PDF 



r i 



v m 



0<v<V m 



max 

0, elsewhere 



fv (v) = 

then the time T n is expressed by T„ — Z/V with PDF 

oo 

fr n (t)= j \w\ f z (tw)f v (w)dw 
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(11.30) 
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Figure 1 1.28 Illustration of distance from cell entering point (A on cell boundary), to cell 
exiting point (C on cell boundary). 



3V m - dx 7Tt 2 
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and the cdf of T n is 
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(11.31) 



To find the distribution of T h , in the next step we note that, when a handoff call is attempted, it 
is always generated at the cell boundary, which is taken as the boundary of the approximating 
circle. Therefore, to find 7], one must recognize that the mobile will move from one point on 
the boundary to another. The direction of a mobile when it crosses the boundary is indicated 
by the angle 6 between the direction of the mobile and the direction from the mobile to the 
center of a cell as shown in Figure 11.28 [75, 76]. 

If the mobile moves in any direction with equal probability, the random variable 6 has 
PDF given by 



fern = 
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The distance Z as shown in Figure 1 1 .28 is Z = 2R eq cos 0, which has a CDF given by 

0, z < 0 



F z (z) = P r {Z < z] = 



1 arccos 
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(11.32) 



The PDF of Z is 



fz(z) = -Fz(z) = 
dz 
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The time in the cell 7], is the time that a mobile travels the distance Z with speed V, then 
T h — Z/ V. With the same assumption about V, the PDF of T h is 
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and the CDF of 7j, is 
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Figure 1 1.29 shows the mean channel holding time in a cell T H . Notice that T H becomes 
smaller with smaller cell size, but sensitivity to change in cell size is smaller for larger cells. 
Finally, earlier in Section 11.2 we approximated the cumulative distribution function of 
the channel holding time in a cell as suggested in References [75, 76]. The goodness-of-fit 
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Figure 1 1.29 Mean channel holding time (s) in cell vs R (average call duration = 120 s). 



Table 11.1 The goodness-of-fit G of the approximation 



Cell radius, R 
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0.020220 
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0.000003 
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8.0 
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10.0 


0.000107 


12.0 


0.000086 


14.0 


0.000066 


16.0 


0.000053 



G of this approximation, defined in Equation (1 1.8), is shown in Table 1 1.1 for various cell 
sizes. We see that G is very small for a wide ranges of cell radius R. These values support 
the use of the approximation in the calculations. 



1 1 .4 MOBILITY PREDICTION IN PICO- AND 
MICROCELLULAR NETWORKS 

It should be expected that 4G networks will further reduce the size of the cells. In a micro- 
and picocellular network, resources availability varies frequently as users move from one 
access point to another. In order to deterministically guarantee QoS support for a mobile 
unit, the network must have prior exact knowledge of the mobile's path, along with arrival 
and departure times to every cell along the path. With this knowledge, the network can 
verify the feasibility of supporting the call during its lifetime, as the mobile moves across 
the network. It is impractical, however, to require the users to inform the network of their 
exact movement, since they may not know this information a priori. Even if the path is 
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known, the exact arrival and departure times to the cells along the path are still hard to 
determine in advance. Therefore, it becomes crucial to have an accurate mechanism to 
predict the trajectory of the mobile user. 

As an example, the virtual connection tree is designed to support QoS guarantees for 
mobile units [77]. In this scheme, a number of adjacent cells are grouped into a cell cluster 
in a static fashion. Upon the admission of a call, the scheme pre-establishes a connection 
between a root switch and each base station in the cell cluster. The scheme does not take 
user mobility into consideration to predict the set of base stations which may potentially 
be visited by the mobile unit. This may result in an unnecessary resource overloading that 
may underutilize the network resources and cause severe congestion. 

The shadow cluster (SC) scheme [78] provides a distributed call-admission control 
(CAC) based on the estimated bandwidth requirements in the SC. An SC is a collection 
of base stations to which a mobile unit is likely to attach in the future. The admission 
decision is made in a distributed fashion by all the base stations within the SC. The scheme 
partitions the time into predefined intervals and verifies the feasibility of supporting calls 
over those intervals. This requires the communication of large number of messages between 
base stations during every time interval. Moreover, since bandwidth estimates are calculated 
at the beginning of each time interval and the admission decisions are made at the end of 
each time interval, admission of new calls is delayed for at least a time equal to the length 
of these predefined time intervals. 

Both of the above two schemes lack the mechanism to predict the mobile's trajectory 
and determine the future cells to which the mobile may hand off. Several techniques have 
been proposed in the literature to address this issue. In Bharghavan and Mysore [79], a 
profile-based algorithm is proposed to predict the next cell that the mobile unit will hand 
off, using a user profile and a cell profile, which are simply the aggregate values of the 
handoff's history. In Liu and Maguire [80], a mobile motion prediction (MMP) algorithm 
is proposed to predict the future locations of the mobile unit. This algorithm is based on a 
pattern matching technique that exploits the regularity of the users' movement patterns. The 
MMP algorithm was further expanded to a two-tier hierarchical location prediction (HLP) 
algorithm [81]. In the latter case, the two-tiered prediction scheme involves both an intercell 
and an intracell tracking and prediction component. The first tier uses an approximate 
pattern matching technique to predict the global intercell direction and the second tier uses 
an extended self-learning Kalman filter to predict the trajectory within the cell using the 
measurements received by the mobile unit. 

In order to support QoS guarantees of multiple classes of services, the scheme must inte- 
grate call and admission control with the mobility profile of the mobile user. The integration 
of these two components makes it possible to use mobility prediction to verify the feasibility 
of admitting a new call and make sure that the required QoS can be supported during the 
lifetime of the call. In other words we should be able to predict location (space) and time 
when a certain resources well be needed in the network. This concept will be referred to 
as space time predictive QoS or STP QoS. The mobility prediction algorithm must be easy 
to implement and maintain, since it will be invoked on a per-user basis. Furthermore, the 
admission control procedure must be invoked only when needed with minimum overhead 
and in a distributed fashion, where each network cell, potentially involved in supporting the 
QoS requirements of the call, participates in the decision process [82, 83]. 

In this section, we present such a framework, which efficiently integrates mobility pre- 
diction and CAC, to provide support for PST-QoS guarantees, where each call is guaranteed 
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its QoS requirements for the time interval that the mobile unit is expected to spend within 
each cell it is likely to visit during the lifetime of the call. 

In this framework, efficient support of PST-QoS guarantees is achieved based on an 
accurate estimate of mobile's trajectory as well as the arrival and departure times for each 
cell along the path. Using these estimates, the network can determine if enough resources 
are available in each cell along the mobile's path to support the QoS requirements of the 
call. The framework is designed to easily accommodate dynamic variations in network 
resources. The basic components of this framework are: 

(1) a predictive service model to support timed-QoS guarantees; 

(2) a mobility model to determine the mobile's most likely cluster (MLC); the MLC 
represents a set of cells that are most likely to be visited by a mobile unit during its 
itinerary; and 

(3) a CAC model to verify the feasibility of supporting a call within the MLC. 

The service model accommodates different types of applications by supporting integral 
and fractional predictive QoS guarantees over a predefined time-guarantee period. The 
MLC model is used to actively predict the set of cells that are most likely to be visited by 
the mobile unit. For each MLC cell, the mobile's earliest arrival time latest arrival time and 
latest departure time are estimated. These estimates are then used by the CAC to determine 
the feasibility of admitting a call by verifying that enough resources are available in each 
of the MLC cells during the time interval between the mobile's earliest arrival time and its 
latest departure time. If available, resources are then reserved for the time interval between 
the mobile's earliest arrival time and latest departure time and leased for the time interval 
between the mobile's earliest and latest arrival times. If the mobile unit does not arrive before 
the lease expires, the reservation is canceled and the resources are returned to the pool of 
available resources. The unique feature of the this frame-work is the ability to combine the 
mobility model with the CAC model to determine the level of PST-QoS guarantees that the 
network can provide to a call and dynamically adjust these guarantees as the mobile unit 
moves across the network. 

1 1 .4.1 PST-QoS guarantees framework 

The first approach, to achieve a high level of QoS support guarantees in mobile environments, 
is to allocate resources for the duration of the call in all future cells that the mobile unit 
will visit. This means that the resources within each cell that is to be visited will be held, 
possibly for the duration of the call, even if the mobile never moves into the cell. This 
approach is similar to the one proposed in Talukdar et al. [85] and will be referred to as a 
predictive space or PS QoS model. Clearly, such an approach will result in underutilization 
of the network resources as resources are being held, but not used by any call. 

The second approach is to only reserve resources in all future cells that the mobile unit 
may visit for the time interval during which the mobile will reside in each cell. If t t and 
t i+ i represent the expected arrival and departure times of the mobile unit to cell i along the 
path, respectively, resources in cell i will only be reserved for the time interval [?, , 
[84]. Unlike the first approach, this approach is likely to increase resource utilization, since 
resources in every cell remain available to other calls outside the reservation intervals. 
This approach, however, is only feasible if exact knowledge of the mobile path and arrival 
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and departure times to every cell along the path is available. Obtaining exact knowledge 
of mobile mobility is not possible in most cases, due to the uncertainty of the mobile 
environments and the difficulty in specifying the mobility profiles of mobile units. An 
acceptable level of service guarantees, however, can be achieved if the path of the mobile 
can be predicted accurately. This approach is discussed in this section and will be referred 
to as predictive space and time or the PST QoS model. The model attempts to achieve 
a balance between an acceptable level of service guarantees and a high level of network 
resource utilization. Based on this model, the required QoS support level is guaranteed by 
reserving resources in advance in each cell that is most likely to be visited by the mobile 
unit. These reservations only extend for a time duration equal to the time interval the mobile 
unit is expected to spend within a cell, starting from the time of its arrival time to the cell 
until its departure time from the cell. In order to characterize the set of 'most likely' cells 
and capture the level of QoS guarantees requested by the application, the service model 
uses the following parameters: 

(1) the time guarantee period T G ; 

(2) a cluster-reservation threshold r, and a 

(3) bandwidth-reservation threshold, y. 

All of these parameters are application-dependent. The parameter T c specifies the time 
duration for which the required QoS level is guaranteed; r defines the minimum percentage 
of the most likely cells to be visited by the mobile unit that must support the required QoS 
level for the guarantee period T G . The parameter y represents the minimum percentage of 
the required bandwidth that must be reserved in every cell that is most likely to be visited. 

To accommodate different types of applications, the service model provides two types 
of predictive service guarantees, namely, integral guaranteed service and fractional guar- 
anteed service. The integral guaranteed service ensures that all cells, which are most likely 
to be visited by the mobile unit, can support the requested bandwidth requirements for the 
lifetime of the call. In this case, Tc must be equal to the call duration and r and y are both 
equal to 100 %. The fractional guaranteed service, on the other hand, guarantees that at least 
r % of these cells can support at least the y % of the requested bandwidth requirements for 
the next Tc interval. A special case arises when either the value of r or y is zero. In this 
case, the service is referred to as best effort. 

1 1 .4.2 Most likely cluster model 

The MLC model considers that the 'most likely to be visited' property of a cell is directly 
related to the position of the cell with respect to the estimated direction of the mobile unit. 
This likelihood is referred to as directional probability. Based on this metric, cells that are 
situated along the mobile unit's direction have higher directional probabilities and are more 
likely to be visited than those that are situated outside of this direction. 

Based on the above, the MLC at any point in time during the lifetime of a call is defined as 
a collection of contiguous cells, each of which is characterized by a directional probability 
that exceeds a certain threshold. For each MLC cell, the expected arrival and departure 
times of the mobile are estimated. Using these estimates, the feasibility of supporting the 
requested level of timed-QoS guarantees during the mobile's residence time within each cell 
along path is verified. In the following, we present the method used to predict the direction 
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of a mobile unit and the scheme used to construct its MLC. We then describe the algorithm 
used to estimate the expected times of arrival and departure of the mobile unit to a given 
cell within the MLC [84]. 

The direction-prediction method used by MLC to predict the mobile user's direction is 
based on the history of its movement. It is clear, however, that the prediction method used 
should not be greatly affected by small deviations in the mobile direction. Furthermore, the 
method should converge rapidly to the new direction of the mobile unit. To take the above 
properties into consideration, a first-order autoregressive filter, with a smoothing factor a, 
is used. More specifically, let D$ be the current direction of the mobile unit when the call is 
made. Notice that, when the mobile is stationary within a cell, it is assumed that the current 
cell is the only member of the MLC, so reservations are done only within the current cell. 
If D, represents the observed direction of the mobile unit at time t and D, represents the 
estimated direction at time t, the predicted direction D t+ \ at t + 1 is obtained as D t+ \ — 
(1 — a) D, + aD,. In order to track the actual direction of the mobile unit more accurately, 
the smoothing factor a is computed as a — cEf/a s+ \ where 0 < c < 1, E s = D s — D s is 
the prediction error, and a, is the average of the past square prediction errors at time s. a s 
can be expressed as a s+ i — cE^ + (1 — c) cr s . 

The directional probability, at any point in time t, of any cell being visited next by a 
mobile unit, can be derived based on the current cell, where the mobile resides, and the 
estimated direction D, of the mobile unit at time t. The basic property of this probability 
distribution is that for a given direction, the cell that lies on the estimated direction from 
the current cell has the highest probability of being visited in the future [83]. Consider a 
mobile unit currently residing at cell i coming from cell m and let j = 1 , 2, . . . , represent a 
set of adjacent cells to cell i. Each cell j is situated at an angle cotj from the x-axis passing 
by the center of cell ;, as presented in Figure 1 1.30. If we define the directional path from 
i to j as the direct path from the center of cell i to the center of cell j, the directionality 
Dij for a given cell j can be expressed as 



D U 



4>a > 0 

<Pu (11-36) 

o u , 4>a = o 




Figure 1 1.30 Parameters used to calculate the directional probability. 
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where is an integer representing the deviation angle between the straight path to des- 
tination and the directional path from ;' to j, while Ojj represents the angle between the 
directional path from m to i and the directional path from ;' to j. 

Based on its directionality D,- 7 -, the directional probability Pi^j of cell j being visited 
next by a mobile unit currently at cell i can be expressed as P^j — Djj/ ^ D ik where k is 
a cell at the same ring as j with respect to i . A cell k is said to be at ring L with respect to cell 
i if it is located at a ring L cells away from i. For a given cell i, the directional probabilities 
Pi^j provide the basis upon which MLCs are formed as the mobile units moves across the 
network. 

1 1.4.2. 1 Forming the most likely cluster 

Starting from the cell where the call originated, a mobile unit is expected to progress 
toward its destination. The mobile unit, however, can temporarily deviate from its long- 
term direction to the destination, but is expected to converge back at some point in time 
toward its destination. This mobility behavior can be used to determine the cells that are 
likely to be visited by a mobile unit. 

Let us define the forward span as the set of cells situated within an angle with respect 
to the estimated direction D, of the mobile unit as illustrated in Figure 11.31. Based on the 
directional probabilities and the definition of a forward span, the MLC of a given mobile 
unit u currently located at cell denoted as C ( MLC (m), can be expressed as C, MLC (w) = 
{cells j | (j>ij < St, j = 1, 2, . . .} where 0^ is the deviation angle between the straight path 
to destination and the directional path from i to j. The angle S, is defined such that P^j > H, 
where fi represents a system defined threshold on the likelihood that cell is to be visited. 
More specifically, S t can be expressed as <5,- = max \<pjj \ such that P^j > fi. 

The next step in the process of forming the MLC is to decide on the size of the MLC 
window, Wmlc, which represents the number of adjacent rings of cells to be included in the 
MLC. Define Ring, j to be the ring at which cell j is located with respect to cell i . Therefore, 
a cell k is included in C, MLC (m), if Ring, , k < W hM c which gives C, mlc (m) = {cells j | $, 7 < 
Si , and Ring, ■ < Wmlc 7 = 1,2,...}. The size of the MLC window has a strong impact 




Figure 11.31 Definition of the MLC. 
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on the performance of the scheme. Increasing the MLC window size, by including more 
rings, increases the likelihood of supporting the required QoS if the mobile moves along 
the predicted direction D (t). On the other hand, if the mobile deviates from the predicted 
direction, increasing the MLC window size may not ensure the continued support of the 
call, as the mobile unit may move out from the MLC. A possible approach is to reward 
users who move within the predicted direction by increasing their MLC window size up 
to a maximum R m . dx . The value of R m . ix depends on the value of the guarantee period T G . 
Higher values of T c result in larger values of R max . 

When the user deviates from the estimated direction, the MLC window size is decreased 
by an amount proportional to the degree of deviation. As a result, support of the predictable 
users' QoS requirements can be achieved with high probability, whereas users with unpre- 
dictable behavior do not unnecessarily overcommit the network resources. The algorithm 
dynamically updates the size of the MLC window based on the observed movement pat- 
terns of the mobile users. If A, is the measure of the mobile's deviation with respect to the 
estimated direction at time t, defined as A t+1 = /}A. t + (1 — P)\D t — D t \ with 0 < f3 < 1 
and A 0 equal to zero, the MLC window size W M lc at time t can be defined as follows: 



The MLC window size is recalculated at every handoff; therefore, the window size shrinks 
and grows depending on the mobile's behavior. 

The method can be easily extended to cellular network with cells of different sizes. When 
a cellular network has cells of different sizes, the definition of rings is different. The rings 
are imaginary circles centered at the current cell. The radius of the first ring R\ is equal to 
the distance from the center of the current cell to the center of the neighboring cell whose 
center is farthest away. Consequently, the radius of a ring where i = 1, 2, . . . , is equal 
i x R\ . Any cell that has its center within the boundaries of a ring is considered in that ring. 

The time of arrival and residence time of the mobile can be estimated for each MLC 
cell. Based on these estimates, the feasibility of supporting the requested level of timed-QoS 
guarantees within the residence time can then be verified. The cell residence time within cell 
j for a mobile unit currently in cell is characterized by three parameters, namely, expected 
earliest arrival time [T EA (i, j)], expected latest arrival time [T LA (7, j)], and expected latest 
departure time [T LD (i, j)]. Consequently, [7eaO\ j), T LD (i, j)] is the expected residence 
time of the mobile unit within cell j. This interval is referred to as the resource reservation 
interval (RRI), while the interval [7ea0\ j), ?laO\ j)] is referred to as the resource leasing 
interval (RLI). Resources are reserved for the entire duration of RRI. However, if the mobile 
does not arrive to cell before RLI expires, all resources are released and the reservation is 
canceled. This is necessary to prevent mobile units from holding resources unnecessarily. 

In order to derive these time intervals, one can adopt the method used in the SC and 
consider all possible paths from the current cell to each cell in the cluster [78]. This method 
can be complex, since there are many possible paths that a mobile unit may follow to reach 
a cell. The approach taken in the MLC model is based on the concept of most likely paths 



Consider a mobile unit u, currently located at cell m, and let (u) denote its MLC. 
Define G — (V, E) to be a directed graph, where V is a set of vertices and E a set of edges. 
A vertex i>; € V represents MLC cell For each cell i and j in C^f LC («), an edge (u,-, Vj) 




(11.37) 



[84]. 
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is in E if and only if j is a reachable direct neighbor of i. Each directed edge is (d,-, vj) in 
G is assigned a cost l/P^j. 

A path n between MLC cells i and £ is defined as a sequence of edges (t;,-, 
(i>;+i, Ui+2), • • ■ , (Vk-i, Vk). The cost of a path between MLC cells i and k is derived from 
the cost of its edges so that the least costly path represents the most likely path to be followed 
by the mobile. A k-shortest paths algorithm [86] is then used to obtain the set K of k-most 
likely paths to be followed by the mobile unit. 

For each path Yl e K between MLC cell i and j, we define the path residence time as 
the sum of the residence time of each cell in the path. Let Yl s and 0 1 in K, represent the 
paths with the shortest and longest path residence time, respectively. Yl s is used to derive the 
expected earliest arrival time , while FL is used to derive expected latest arrival 7la(z, j). 
So, 7ea('\ j) and T LA (i, j) can be expressed, respectively, as 



where S max (k) and S m i n (k) represent the average maximum and minimum speed for cell k, 
respectively. S msx (k) and Smi n (&) are provided by the network support based on the observed 
mobile units' speeds. d(m, k, n) is the main distance within cell k given that cells m, k, and 
n are three consecutive cells in the path. The value of d(m, k, n) depends on whether cell 
k is the cell where the call originated, an intermediate cell or the last cell in the path, i.e. 
cell j 



When k is the originating cell, the pdf /y(y) of the distance Y, within cell k as shown in 
Figure 1 1.32, is derived, assuming that the mobile units are evenly spread over a cell area of 




(11.38) 
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do(k, n), if k is the originating cell 

di(m, k, n), if k is the intermediate cell 

du_'(k, n), if m — n 

d\_,{rn, k), if k is the last cell, k = j 



(11.39) 
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Figure 11.32 Distance Y in originating cell k. 



352 MOBILITY MANAGEMENT 

radius R travel along a constant direction within the cell and can exit from the cell from any 
point along the border with cell n. Therefore, the position of the mobile unit is determined by 
the angle v and the distance r from the center of the cell, v is uniformly distributed between 
0 and 2rc\r is uniformly distributed between 0 and R. Since v is uniformly distributed, 
<f> is also uniformly distributed between 0 and jr. Therefore, d (k, n) is equal to the mean 
distance E [Y] of the PDF fy ( v). Based on these assumptions the PDF fy ( v)in a cell where 
the call originates can be obtained using the standard methods as described in [75]: 



When k is an intermediate cell, the PDF f Y (y) of the distance Y, within cell k, as shown in 
Figure 11.33, is derived assuming that the mobile units enter cell k from cell m at any point 
along the arc AB of cell k. This arc is defined by the angles fi\ and fa: The mobile travels 
along a constant direction within the cell and can exit from cell k to n from any point along 
the arc CD of cell k, which is defined by the angles a>\ and a>2- The direction of the mobile 
is indicated by the angle <f>, which is uniformly distributed; dim, k, n) is equal to the mean 
distance E [Y] of the PDF fy (y), which is derived in Aljadhai and Znati [84] (see also the 




which gives 



2R 




(11.40) 



o 



Cell h's border 




Figure 1 1.33 Distance Y in an intermediate cell k. 
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Appendix for details) as 



di(m, k, n) = E[Y] = 
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The mean distance in the last cell in the path is derived as follows: 
d\_{m, k) — ma\d(m, k, q) Vq adjacent to k, q ^ m 



for P2 < w\ 

(11.41) 

(11.42) 



The mean distance in the cell k when the path makes a loop within cell k is derived as 
follows: 

du>(m,k, n) = 2d 0 (k,n) (11.43) 

Similarly, the expected latest departure time T LD (i, j) from cell j can be computed as: 

7ld(», j) = Tla(U j) + d(m, k)/S min (k) (11.44) 

The estimates of 7ea0\ j), T LD (i, j) and T LD (i, j) for a mobile u currently located at cell i 
are used to compute RLI and RRI for each cell j e Cf 1LC (M). The CAC uses these values 
to verify the feasibility of supporting m's call in each cell j e C, mlc (m). 

A good agreement between the results of the analytical model of the distance, based on 
Equations (11.40) and (11.41), and the simulation results of mobile units traveling along 
the same path, is demonstrated in Aljadhai and Znati: [84]. 



1 1.4.2.2 Performance example 

The MLC CAC scheme is compared with the SC scheme based on the following assumptions 
[78, 84]. 

(1) Each cell covers 1 km along a highway. The highway is covered by 10 cells. Mobile 
units can appear anywhere along a cell with equal probability. 

(2) Call holding time is exponentially distributed with mean T H — 130 and 180 s. 

(3) Total bandwidth of each cell is 40 bandwidth units (BUs). Three types of calls are 
used: voice, audio, and video, requiring 5 vo ice = 1 BU, B au dio = 5 BUs and B v ideo = 
10 BUs, respectively. The probabilities of each type are, P vo ice = 0.7, P au dio = 0.2, 
and B video =0.1. 

(4) Mobile units may have one of three different speeds: 70, 90 or 105 km/h. The 
probability of each speed is 1/3. 

(5) In the SC scheme, the time is quantized in time interval of length 10 s. 
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(6) A reference scheme, referred to as clairvoyant scheme (CS), is introduced. In this 
scheme, the exact behavior of every mobile unit is assumed to be known at the 
admission time. CS reserves bandwidth in exactly the cells that the mobile unit will 
visit and for the exact residence time interval in every cell. Therefore, CS produces 
the maximum utilization and minimum blocking ratio for a specific dropping ratio, 
which is zero in this case. 

Since mobile units can appear anywhere along a cell, the residence time within the initial 
cell (the cell in which the call originates) is selected to be uniformly distributed between 
zero, and a duration equal to cell length/speed. The initial direction probability is assumed 
to be 0.5 for both possible directions, i.e. left and right directions. After the first handoff, the 
direction and position of the call become known and, therefore, it is possible to determine 
the arrival and departure time in other cells. 

Figure 1 1.34 shows the blocking ratio and Figure 1 1.35 utilization of the three schemes 
as functions of the call arrival rate. As expected, CS produces the maximum utilization 
and minimum blocking ratio assuming a zero dropping ratio condition. The utilization in 
the MLC CAC is better than SC as the call arrival rate exceeds 0.06 for all mean call 
holding times. Moreover, the call blocking in MLC CAC is much less than that of the 
SC scheme. This behavior shows that, by simply reserving bandwidth between the earliest 
arrival time and latest departure time at a cell, the MLC scheme accepts more calls and 
increases utilization. Moreover, the increase in the utilization in the SC scheme is very slow 
when the call arrival rate is greater than 0.06. The reason for this behavior is that the SC 
bases its estimates on the exponential holding time PDF, which decreases as time increases. 
Therefore, the bandwidth estimates decreases as the distance to a future cell increases. 
As a result, the chance of dropping the call in subsequent cells is increased unless the 
minimum survivability estimate is increased. In Figures 11.34 and 11.35, the MLC CAC 
always outperforms the SC regardless of the mean holding time. 




Figure 1 1.34 Blocking ratio in three systems. 
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Figure 1 1.35 Utilization in three systems. 
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APPENDIX: DISTANCE CALCULATION IN AN INTERMEDIATE CELL 



Given an intermediate cell on the path of the mobile unit, the PDF of the distance can be 
derived based on the angles j3 and a>, as shown in Figure 11.33. The entry point to the 
cell is assumed to be the point E, as shown in Figure 11.33. The mobile unit move in a 
direction evenly distributed leading to the next cell (Figure 1 1.33), where 2i/f is the range of 
the direction angle </>. The angle is uniformly distributed between ji\ and fc. Therefore, 



1 



fp (/J) = 

Since <j> is evenly distributed, we have 

M4> I P) = 



Pi<P<fo 



Pi - Pi 

0, elsewhere 
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elsewhere 



The CDF of the above function can be represented as 
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where \jr is defined as 



n - (/? - a>i), p >a>2 

it — (o)j — P), ft < cl>\ and i ^ j. 



(11.48) 



If Y is the distance traveled from E to X, as in Figure 1 1.33 then, Y becomes Y — 2R cos (p 
and gives in Equation (1 1.47) the following four cases. 



Case 1: f 2 /2 > fi/2 > 0 
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The PDF of j is 
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(11.50) 



(11.51) 



(11.52) 



The mean distance E [Y] for cell id(m, i, j) for a mobile path entering cell i from cell 
m and exiting cell i to cell j is 



d(m, i, 7) = E[Y] = j E[Y\ £]//,(/?) d/3 
ft 

/ • sin — — sin — 



(11.53) 
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Case 2: fa/2 <fa/2<0 
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The PDF of y is 
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The mean distance E [Y] for cell id(m, i, j) for a mobile path entering cell i from cell 
m and exiting cell (' to cell j is 



d(m, i, j) = E[Y] = J E[Y\ p]fp(P)dp 
Pi 
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(11.59) 
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The mean distance E[Y | P] is 
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The mean distance E [Y] for cell z'd(m, i, j) for a mobile path entering cell i from cell m 
and exiting cell i to cell j is 
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Case 4: |^i/2| < ir 2 /2 
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The mean distance E[Y] for cell id(m, i, j) for a mobile path entering cell ;' from cell m 
and exiting cell i to cell j is 



d(m, i, j) = E[Y] = E[Y\ P\MP)dp 



in 

/ 



1 

f Pi-Pi 



(11.79) 



4/? 

1A2 - ^1 











sin ( 




- sinj 


2)] 











REFERENCES 361 



8/? 



(« 2 - o)\)ih - Pi) 



sin 



^ P 2 - a>2 ^ _ ^ ^ Pi - (Qi ^j 



32 - 0)1 \ . Pi- C0 2 

— sin I '■ + sin 



SR 



(a> 2 - eoi)(p 2 - Pi) 



coi- p 2 \ . co\- pi 
sin | — sin 



. Pi\ , . fa>2- PiX 
sin + sin 

V 2 ; v 2 / 



2 

Pi < 0)1 



(11.80) 



REFERENCES 



[1] A. Acampora, Wireless ATM: A perspective on issues and prospects, IEEE Person. 
Commun., vol. 3, 1996, pp. 8-17. 

[2] A. Acampora, An architecture and methodology for mobile-executed handoff in cel- 
lular ATM networks, IEEE J. Select. Areas Commun., vol. 12, 1994, pp. 1365-1375. 

[3] A. Acharya, J. Li, F. Ansari and D. Raychaudhuri, Mobility support for IP over wireless 
ATM, IEEE Commun. Mag., vol. 36, 1998, pp. 84-88. 

[4] A. Acharya, J. Li, B. Rajagopalan and D. Raychaudhuri, Mo-bility management in 
wireless ATM networks, IEEE Commun. Mag., vol. 35, 1997, pp. 100-109. 

[5] I.F. Akyildiz, J. McNair, J.S.M. Ho, H. Uzunalioglu and W. Wang, Mobility manage- 
ment in current and future communication networks, IEEE Network Mag., vol. 12, 
1998, pp. 39^19. 

[6] I.F. Akyildiz and J.S.M. Ho, On location management for personal communications 

networks, IEEE Commun. Mag., vol. 34, 1996, pp. 138-145. 
[7] I.F. Akyildiz, J.S.M. Ho and Y.B. Lin, Movement-based location update and selective 
paging for PCS networks, IEEE/ACM Trans. Networking, vol. 4, 1996, pp. 629-636. 
[8] I.F. Akyildiz and J.S.M. Ho, Dynamic mobile user location update for wireless PCS 

networks, ACM-Baltzer J. Wireless Networks, vol. 1, no. 2, 1995, pp. 187-196. 
[9] B. Akyol and D. Cox, Re-routing for handoff in a wireless ATM network, IEEE 
Personal Commun., vol. 3, 1996, pp. 26-33. 

[10] V. Anantharam, M.L. Honig, U. Madhow and V.K. Wei, Optimization of a database 
hierarchy for mobility tracking in a personal communications network, Performance 
Eval, vol. 20, no. 1-3, 1994, pp. 287-300. 

[1 1] E. Ayanoglu, K. Eng and M. Karol, Wireless ATM: Limits, challenges, and proposals, 
IEEE Personal Commun., vol. 3, 1996, pp. 19-34. 

[12] A. Bar-Noy, I. Kessler and M. Sidi, Topology-based tracking strategies for per- 
sonal communication networks, ACM-Baltzer J. Mobile Networks and Applications 
(MONET), vol. 1, no. 1, 1996, pp. 49-56. 

[13] A. Bar-Noy, I. Kessler, and M. Sidi, Mobile users: to update or not to update? ACM- 
Baltzer J. Wireless Networks, vol. 1, no. 2, 1995, pp. 175-186. 

[14] S. Dolev, D.K. Pradhan and J.L. Welch, Modified tree structure for location man- 
agement in mobile environments, Comput. Commun., vol. 19, no. 4, 1996, pp. 335- 
345. 



362 MOBILITY MANAGEMENT 



[15] F. Dosiere, T. Zein, G. Maral and J.P. Boutes, A model for the handover traffic in low 
earth-orbiting (LEO) satellite networks for personal communications, Int. J. Satellite 
Commun., vol. 11, 1993, pp. 145-149. 

[16] N. Efthymiou, Y.F. Hu and R. Sheriff, Performance of inter-segment handover proto- 
cols in an integrated space/terrestrial-UMTS environment, IEEE Trans. Veh. Technol, 
vol. 47, 1998, pp. 1179-1199. 

[17] E. Guarene, P. Fasano and V. Vercellone, IP and ATM integration perspectives, IEEE 
Commun. Mag., vol. 36, 1998, pp. 74-80. 

[18] J.S.M. Ho and I.F. Akyildiz, Dynamic hierarchical data-base architecture for location 
management in PCS networks, IEEE/ACM Trans. Networking, vol. 5, no. 5,1997, 
pp. 646-661. 

[19] J.S.M. Ho and I.F. Akyildiz Local anchor scheme for reducing signaling cost in per- 
sonal communication networks, IEEE/ACM Trans. Networking, vol. 4, no. 5, 1996, 
pp. 709-726. 

[20] J.S.M. Ho and I.F. Akyildiz, A mobile user location update and paging mecha- 
nism under delay constraints, ACM-Baltzer J. Wireless Networks, vol. 1, no. 4, 1995, 
pp. 413-425. 

[21] D. Hong and S. Rappaport, Traffic model and performance analysis for cellular mobile 

radio telephone systems with prioritized and nonprioritized handoff procedures, IEEE 

Trans. Veh. Technol, vol. 35, 1986, pp. 77-92. 
[22] L.-R. Hu and S. Rappaport, Adaptive location management scheme for global personal 

communications, in Proc. IEEE Communications, vol. 144, no. 1, 1997, pp. 54-60. 
[23] C.-L. I, G.P Pollini and R.D. Gitlin, PCS mobility manage-ment using the reverse 

virtual call setup algorithm, IEEE/ACM Trans. Networking, vol. 5, 1997, pp. 13-24. 
[24] R. Jain and Y.B . Lin, An auxiliary user location strategy employing forwarding pointers 

to reduce network impact of PCS, ACM-Baltzer J. Wireless Networks, vol. 1, no. 2, 

1995, pp. 197-210. 

[25] R. Jain, Y.B. Lin and S. Mohan, A caching strategy to reduce network impacts of PCS, 

IEEE J. Select. Areas Commun., vol. 12, 1994, pp. 1434-1444. 
[26] D. Johnson and D. Maltz, Protocols for adaptive wireless and mobile networking, 

IEEE Personal Commun., vol. 3, 1996, pp. 34-42. 
[27] S.J. Kim and C.Y Lee, Modeling and analysis of the dynamic location registration and 

paging in microcellular systems, IEEE Trans. Veh. Technol, vol. 45, 1996, pp. 82-89. 
[28] P. Krishna, N. Vaidya, and D.K. Pradhan, Static and adaptive location management in 

mobile wireless networks, Comput. Commun., vol. 19, no. 4, 1996, pp. 321-334. 
[29] B. Li, S. Jiang and D. Tsang, Subscriber-assisted handoff support in multimedia PCS, 

Mobile Comput. Commun. Rev., vol. 1, no. 3, 1997, pp. 29-36. 
[30] Y.B. Lin Paging systems: network architectures and inter-faces, IEEE Network, vol. 

11, 1997, pp. 56-61. 

[31] Y.B. Lin, Reducing location update cost in a PCS network, IEEE/ACM Trans. Net- 
working, vol. 5, 1997, pp. 25-33. 

[32] Y.-B. Lin and I. Chlamtac, Heterogeneous personal communication services: Integra- 
tion of PCS systems, IEEE Commun. Mag., vol. 34, 1996, pp. 106-113. 

[33] Y.B. Lin, F.C. Li, A. Noerpel and LP. Kun, Performance modeling of multitier PCS 
system, Int. J. Wireless Information Networks, vol. 3, no. 2, 1996, pp. 67-78. 

[34] Y.B. Lin and S.K. DeVries, PCS network signaling using SS7, IEEE Commun. Mag., 
vol. 33, 1995, pp. 44-55. 



REFERENCES 363 



[35] Y.B. Lin, Determining the user locations for personal communications services net- 
works, IEEE Trans. Veh. Technol, vol. 43, 1994, pp. 466^173. 

[36] J. Markoulidakis, G. Lyberopoulos, D. Tsirkas and E. Sykas, Mobility modeling in 
third-generation mobile telecommunications systems, IEEE Personal Commun. , vol. 4, 
1997, pp. 41-56. 

[37] M. Marsan, C.-F. Chiasserini, R. Lo Cigno, M. Munafo and A. Fumagalli, Local and 
global handovers for mobility management in wireless ATM networks, IEEE Personal 
Commun., vol. 4, 1997, pp. 16-24. 

[38] A.R. Modarressi and RA. Skoog, Signaling system 7: a tutorial, IEEE Commun. Mag. , 
vol. 28, 1990, pp. 19-35. 

[39] S. Mohan and R. Jain, Two user location strategies for personal communications 
services, IEEE Personal Commun., vol. 1, 1994, pp. 42-50. 

[40] R. Pandya, D. Grillo, E. Lycksell, R Mieybegue, H. Okinaka and M. Yabusaki, IMT- 
2000 standards: Network aspects, IEEE Personal Commun., 1997, pp. 20-29. 

[41] C.E. Perkins, Mobile IP: Design Principles and Practices, Addison- Wesley Wireless 
Communications Series. Reading, MA: Addison Wesley, 1998. 

[42] C.E. Perkins, IP mobility support version 2, Internet Engineering Task Force, Internet 
draft, draft-ietf-mobileip-v2-00.text, November 1997. 

[43] C. Perkins, Mobile IP, IEEE Commun. Mag., vol. 35, 1997, pp. 84-99. 

[44] C. Perkins, Mobile-IP local registration with hierarchical foreign agents, Internet En- 
gineering Task Force, Internet draft; draft-perkins-mobileip-hierfa-00.txt, February 
1996. 

[45] B. Rajagopalan, An overview of ATM forum's wireless ATM standards activities, 

ACM Mobile Comput. Commun. Rev., vol. 1, no. 3, 1997. 
[46] B. Rajagopalan, Mobility management in integrated wireless-ATM networks, ACM- 

BaltzerJ. Mobile Networks Applicat. (MONET), vol. 1, no. 3, 1996, pp. 273-285. 
[47] E. del Re, A coordinated European effort for the definition of a satellite integrated 

environment for future mobile communications, IEEE Commun. Mag., vol. 34, 1996, 

pp. 98-104. 

[48] C. Rose, State-based paging/registration: A greedy technique, IEEE Trans. Veh. Tech- 
nol, vol. 48, 1999, pp. 166-173. 

[49] C. Rose and R. Yates, Ensemble polling strategies for in-creased paging capacity in 
mobile communication networks, ACM/Baltzer Wireless Networks J., vol. 3, no. 2, 
1997, pp. 159-167. 

[50] C. Rose and R. Yates, Location uncertainty in mobile networks: a theoretical frame- 
work, IEEE Commun. Mag., vol. 35, 1997, pp. 94-101. 

[51] C. Rose, Minimizing the averagecost of paging and reg-istration: a timer-based method, 
ACM-Baltzer J. Wireless Networks, vol. 2, no. 2, 1996, pp. 109-116. 

[52] C. Rose and R. Yates, Minimizing the average cost of paging under delay constraints, 
ACM-Baltzer J. Wireless Networks, vol. 1, no. 2, 1995, pp. 211-219. 

[53] S. Tabbane, Location management methods for 3rd generation mobile systems, IEEE 
Commun. Mag., vol. 35, 1997, pp. 72-78. 

[54] C.-K. Toh, A unifying methodology for handovers of het-erogeneous connections in 
wireless ATM networks, ACM SIGCOMM Comput. Commun. Rev., vol. 27, no. 1, 
1997, pp. 12-30. 

[55] C.-K. Toh, A hybrid handover protocol for local area wireless ATM networks, ACM- 
Baltzer J. Mobile Networks Applicat. (MONET), vol. 1, no. 3, 1996, pp. 313-334. 



364 MOBILITY MANAGEMENT 



[56] M. Veeraraghavan and G. Dommetry, Mobile location man-agement in ATM networks, 

IEEE J. Select. Areas Commun., vol. 15, 1997, pp. 1437-1454. 
[57] M. Veeraraghavan, M. Karol, and K. Eng, Mobility and connection management in a 

wireless ATM LAN, IEEE J. Select. Areas Commun., vol. 15, 1997, pp. 50-68. 
[58] J.Z. Wang, A fully distributed location registration strategy for universal personal 

communication systems, IEEE J. Select. Areas Commun., vol. 11, 1993, pp. 850-860. 
[59] M. Werner, C. Delucchi, H.-J. Vogel, G. Maral, and J.-J. De Ridder, ATM-based 

routing in LEO/MEO satellite networks with intersatellite links, IEEE J. Select. Areas 

Commun., vol. 15, 1997, pp. 69-82. 
[60] M. Werner, A. Jahn, E. Lutz, and A. Bottcher, Analysis of system parameters for 

LEO/ICO-satellite communication net-works, IEEE J. Select. Areas Commun. , vol. 13, 

1995, pp. 371-381. 

[61] R. Yates, C. Rose, B. Rajagopalan and B. Badrinath, Analysis of a mobile-assisted 

adaptive location management strategy, ACM-Baltzer J. Mobile Networks Applicat. 

(MONET), vol. 1, no. 2, 1996, pp. 105-1 12. 
[62] A. Yenerand C. Rose, Highly mobile users and paging: Optimal polling strategies, 

IEEE Trans. Veh. Technol, vol. 47, 1998, pp. 1251-1257. 
[63] D.R. Wilson, Signaling system no. 7, IS-41 and cellular telephony networking, Proc. 

IEEE, vol. 80, 1992, pp. 664-652. 
[64] C. Perkins and D. Johnson, Route optimization in mobile IR Internet Engineering Task 

Force, Internet draft; draft-ietf-mobileip-optom-07.txt, 20 November 1997. 
[65] R Calhoun and C. Perkins, Tunnel establishment protocol, Internet Engineering Task 

Force, Internet draft; draft-ietfmobileip- calhoun-tep-00.txt, 21 November 1997. 
[66] G. Troxel and L. Sanchez, Rapid authentication for mobile IP, Internet Engineering 

Task Force, Internet draft; draft-ietf-mobileip-ra-00.txt, December 1997. 
[67] R. Yuan, S.K. Biswas, L.J. French, J. Li, and D. Raychaudhuri, A signaling and 

control architecture for mobility support, ACM-Baltzer J, Mobile Networks Applicat. 

{MONET), vol. 1, no. 3, 1996, pp. 287-298. 
[68] M. Johnsson, Simple mobile IP, Internet Engineering Task Force, Internet-draft, 

Ericsson; draft-ietf-mobileip-simple-00.txt, March 1999. 
[69] C.B. Becker, B. Patil and E. Qaddoura, IP mobility architecture framework, Inter- 
net Engineering Task Force, Internet draft; draft-ietf-mobileip-ipm-arch-00.txt, March 

1999. 

[70] J.M. Benedetto, Economy-class ion-defying IC's in orbit, IEEE Spectrum, vol. 35, 
1998, pp. 36-41. 

[71] E. Lutz, Issues in satellite personal communication systems, ACM J. Wireless Net- 
works, vol. 4, no. 2, 1998, pp. 109-124. 

[72] B. Miller, Satellite free mobile phone, IEEE Spectrum, vol. 35, 1998, pp. 26-35. 

[73] F. Ananasso and F.D. Priscoli, Issues on the evolution toward satellite personal com- 
munication networks, in Proc. GLOBECOM'95 , London, pp. 541-545. 

[74] E. del Re, R. Fantacci and G. Giambene, Call blocking performance for dynamic 
channel allocation technique in future mobile satellite systems, Proc. Inst. Elect. Eng., 
Commun., vol. 143, no. 5, 1996, pp. 289-296. 

[75] D. Hong and S.S. Rappaport, Traffic model and performance analysis for cellular mo- 
bile radio telephone systems with prioritized and non-prioritized handoff procedures, 
IEEE Trans. Vehic. Technol, vol. VT- 35, no. 3, 1986, pp. 77-92. 



REFERENCES 365 



[76] CEAS Technical Report no. 773, 1 June 1999, College of Engineering and Applied 

Sciences, State University of New York, Stony Brook, NY, USA. 
[77] A. Acampora and M. Naghshineh, Control and quality-of-service provisioning in high 

speed microcellular networks, IEEE Person. Commun., vol. 1, 1994, pp. 36-42. 
[78] D. Levine, I. Akyildiz and M. Naghshineh, Resource estimation and call admission 

algorithm for wireless multimedia using the shadow cluster concept, IEEE/ACM Trans. 

Networking, vol. 5, no. 1, 1997, pp. 1-12. 
[79] V. Bharghavan and J. Mysore, Profile based next-cell prediction in in-door wireless 

LANs, in Proc. IEEE Singapore Int. Conf. Networking, April 1997, pp. 147-152. 
[80] G. Liu and G.Q. Maguire Jr, Transmit activity and intermodal route planner, Technical 

Report, Royal Institute of Technology, Stockholm, February 1995. 
[81] R Bahl, T. Liu and I. Chlamtac, Mobility modeling, location tracking, and trajectory 

prediction in wireless ATM networks, IEEE J. Select. Areas Commun., vol. 16, 1998, 

pp. 922-937. 

[82] A. Aljadhai and T. Znati, A framework for call admission control and QoS support in 
wirelessnetworks, in Proc. INFOCOM99, vol. 3, New York, March 1999, pp. 1014- 
1026. 

[83] A. Aljadhai and T. Znati, A predictive bandwidth allocation scheme for multimedia 
wireless networks, in Proc. Conf. Communication Networks and Distributed Systems 
Modeling and Simulation, Phoenix, AZ, January 1997, pp. 95-100. 

[84] A.R. Aljadhai and T.F. Znati. Predictive mobility support for QoS provisioning in 
mobile wireless environments IEEE J. Selected Areas Commun., vol. 19, no. 10, 2001, 
pp. 1915-1931. 

[85] A. Talukdar, B.R. Badrinath, and A. Acharya, On accommodating mo-bile hosts in an 
integrated services packet network, in Proc. IEEE IN-FOCOM, vol. 3, Kobe, Japan, 
April 1997, pp. 1046-1053. 

[86] S.E. Dreyfus, An appraisal of some shortest-path algorithms, Opns. Res. , vol. 17, 1969, 
pp. 395^112. 



12 



Adaptive Resource 
Management 



12.1 CHANNEL ASSIGNMENT SCHEMES 

A given radio spectrum (or bandwidth) can be divided into a set of disjoint or noninterfering 
radio channels. All such channels can be used simultaneously while maintaining an accept- 
able received radio signal. In order to divide a given radio spectrum into such channels, 
many techniques such as frequency division (FDMA/OFDMA), time division (TDMA/TH 
UWB), or code division (CDMA/MC CDMA) can be used, as discussed in Chapter 2. In 
FDMA, the spectrum is divided into disjoint frequency bands, whereas in TDMA the chan- 
nel separation is achieved by dividing the usage of the channel into disjoint time periods 
called time slots. In CDMA, the channel separation is achieved by using different spreading 
codes. The major criteria in determining the number of channels with a certain quality that 
can be used for a given wireless spectrum is the level of received signal quality that can be 
achieved in each channel. 

If Si(k) is the set (/) of wireless terminals that communicate with each other using the 
same channel k, then due to the physical characteristics of the radio environment, the same 
channel k can be reused simultaneously by another set j if the members of sets i and j 
are spaced sufficiently apart. All such sets which use the same channel are referred to as 
co-channel sets or simply co-channels. The minimum distance at which co-channels can 
be reused with acceptable interference is called the 'co-channel reuse distance' D. For 
illustration see Figure 12.1. 

This is possible because, due to propagation path loss in the radio environment, the 
average power received from a transmitter at distance d is proportional to Pjd^ a where a 
is a number in the range 3-5 depending on the physical environment, and P T is the average 
transmitter power. Thus, by adjusting the transmitter power level and/or the distance d 
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(c) 



Figure 12.1 Examples of frequency reuse factor, (a) Uniform hexagonal cellular layout 
with reuse 7. (b) Macrocell layout with reuse 3. (c) Street microcell layout 
with reuse 2. 



between co-channels, a channel can be reused by a number of co-channels if the CIR 
(carrier-to-interference ratio) in each co-channel is above the required value CIR^. In 
general, for a wireless station R at distance d from a transmitter T, using the same reference 
radio channel as the set of other transmitters 7} at distances d\ from R, we have 



CIR 



Pd- 



E Pid; a + n 0 



(12.1) 



where Nq represents the background noise. To achieve a certain level of CIR at the reference 
station R, different methods can be used. 

In general we can represent the residual interference signal power as 



Ud, 0, f, t) = (1 - C f )(l - C p )(l - C 9 )(l - C t )I{d, 6, f, t) 



(12.1a) 
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where Cf, C p , Cg and C t are frequency, propagation (distance + shadowing + fading), angle 
(space) and time isolation coefficients, respectively. I(d, 9, f, t) is the interference signal 
power without any suppression techniques [1]. For perfect isolation at least one of these 
coefficients is equal to one and the interference has no influence on the received signal. In 
practice, it is rather difficult and economically impractical to reach the point where C, = 1. 
Instead, the product (1 — C r ) (1 — C$) (1 — C t ) depending on these coefficients should be 
kept as low as possible with an affordable effort measured by cost, power consumption and 
physical size of the hardware required for the solution. 

Coefficient Cf is related to frequency assignment in the cellular network while coefficient 
C p is related to the propagation conditions. Cf = 1 if the interfering signal frequency is 
different from the frequency of the useful signal. C p = 1 if, due to propagation losses, the 
interfering signal cannot reach the site of the useful reference signal. In general, the same 
frequency can be used in two cells only if the propagation losses between the two cells are 
high enough so that the interfering signals are attenuated to the acceptable level. Coefficient 
Cg is related to antenna beamforming and the possibilities of reducing the interference level 
by spatial filtering. Finally, interference cancellation and equalization in time domain can 
also be used to reduce the level of interference. 

In this chapter we will focus on two methods for reducing the interference level. First, the 
distance between interfering stations using the co-channel and the reference station R can 
be increased to reduce the co-channel interference level. Many channel allocation schemes 
are based on this idea of physical separation. Another solution to reduce the CIR at R is 
to reduce the interfering powers transmitted from interfering stations and/or to increase 
the desired signal's power level P. This is the idea behind power control schemes. These 
two methods present the underlying concept for channel assignment algorithms in cellular 
systems. Each of these algorithms uses a different method to achieve a CIR m [ n at each mobile 
terminal by separating co-channels and/or by adjusting the transmitter power. 

12.1.1 Different channel allocation schemes 

Channel allocation schemes can be divided into a number of different categories depending 
on the comparison basis. For example, when channel assignment algorithms are compared 
based on the way in which co-channels are separated, they can be divided into fixed channel 
allocation (FCA), dynamic channel allocation (DCA), and hybrid channel allocation (HCA). 

In FCA schemes, the area is partitioned into a number of cells, and a number of channels 
are assigned to each cell according to some reuse pattern, depending on the desired signal 
quality. In DCA, all channels are placed in a pool and are assigned to new calls as needed 
such that the criterion is satisfied. At the cost of higher complexity, DCA schemes provide 
flexibility and traffic adaptability. However, DCA strategies are less efficient than FCA 
under high load conditions. To overcome this drawback, HCA techniques were designed 
by combining FCA and DCA schemes. Channel assignment schemes can be implemented 
in many different ways. For example, a channel can be assigned to a radio cell based on 
the coverage area of the radio cell and its adjacent cells such that the CIR^m is maintained 
with high probability in all radio cells. Channels could be also assigned by taking the local 
CIR measurements of the mobile's and base station's receiver into account. That is, instead 
of allocating a channel blindly to a cell based on worst-case conditions (such as letting 
co-channels be located at the closest boundary), a channel can be allocated to a mobile 
based on its local CIR measurements [2, 3]. 
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Channel assignment schemes can be implemented in centralized or distributed fashion. In 
the centralized schemes the channel is assigned by a central controller, whereas in distributed 
schemes a channel is selected either by the local base station of the cell from which the call 
is initiated or selected autonomously by the mobile. In a system with cell-based control, 
each base station keeps information about the current available channels in its vicinity. Here 
the channel availability information is updated by exchange of status information between 
base stations. In autonomously organized distributed schemes, the mobile chooses a channel 
based on its local CIR measurements without the involvement of a central call assignment 
entity. This scheme is simpler but less efficient. The channel assignment based on local 
assignment can be done for both FCA and DCA schemes. 

12.1.2 Fixed channel allocation 

In the FCA strategy a set of nominal channels is permanently allocated to each cell for its 
exclusive use. Here a definite relationship is assumed between each channel and each cell, 
in accordance with co-channel reuse constraints [4-8]. 

The total number of available channels in the system C is divided into sets, and the 
minimum number of channel sets N (reuse factor) required to serve the entire coverage 
area is related to the reuse distance s as follows [5]: N — <r 2 /3, for hexagonal cells. 

Here o" is defined as D/R, where R is the radius of the cell and D is the physical 
distance between the two cell centres [4]. N can assume only the integer values 3, 4, 7, 
9, . . . , as generally presented by the series, ((' + j)2 — ij, with i and j being integers [4]. 
Figure 12.1(a) and 4(b) gives the allocation of channel sets to cells for N — 3 and N — 1, 
respectively. 

In the simple FCA strategy, the same number of nominal channels is allocated to each 
cell. This uniform channel distribution is efficient if the traffic distribution of the system is 
also uniform. In that case, the overall average blocking probability of the mobile system is 
the same as the call blocking probability in a cell. Because traffic in cellular systems can be 
nonuniform with temporal and spatial fluctuations, a uniform allocation of channels to cells 
may result in high blocking in some cells, while others might have a sizeable number of 
spare channels. This could result in poor channel utilization. So, the number of channels in 
a cell can match the load in it by nonuniform channel allocation [9, 10] or static borrowing 
[11,12]. 

In nonuniform channel allocation the number of nominal channels allocated to each 
cell depends on the expected traffic profile in that cell. Thus, heavily loaded cells are 
assigned more channels than lightly loaded ones. In Zhang and Yum [9] an algorithm, 
called nonuniform compact pattern allocation, is proposed for allocating channels to cells 
according to their traffic distributions. The technique attempts to allocate channels to cells 
in such a way that the average blocking probability in the entire system is minimized. A 
similar technique for nonuniform channel allocation is also employed in the algorithms 
proposed in Oh et al. [10]. 

Simulation results in Zhang and Yum [9] show that the blocking probability using nonuni- 
form compact pattern allocation is always lower than the blocking probability of uniform 
channel allocation. Also, for the same blocking probability, the system can carry on average 
10 % (maximum 22 %) more traffic with the use of the nonuniform pattern allocation [9]. 

In the static borrowing schemes proposed in References [11, 12], unused channels from 
lightly loaded cells are reassigned to heavily loaded ones at distances greater than the 
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minimum reuse distance cr. Although in static borrowing schemes channels are permanently 
assigned to cells, the number of nominal channels assigned in each cell may be reassigned 
periodically according to spatial inequities in the load. This can be done in a scheduled or 
predictive manner, with changes in traffic known in advance or based on measurements, 
respectively. 

12.1.3 Channel borrowing schemes 

In a channel borrowing scheme, an acceptor cell that has used all its nominal channels 
can borrow free channels from its neighboring cells (donors) to accommodate new calls. A 
channel can be borrowed by a cell if the borrowed channel does not interfere with existing 
calls. When a channel is borrowed, several other cells are prohibited from using it. This 
is called channel locking. In contrast to static borrowing, channel borrowing strategies deal 
with short-term allocation of borrowed channels to cells: once a call is completed, the 
borrowed channel is returned to its nominal cell. The proposed channel borrowing schemes 
differ in the way a free channel is selected trom a donor cell to be borrowed by an acceptor 
cell. 

The channel borrowing schemes can be divided into simple and hybrid. In simple channel 
borrowing schemes, any nominal channel in a cell can be borrowed by a neighboring cell 
for temporary use. In hybrid channel borrowing strategies, the set of channels assigned to 
each cell is divided into two subsets, A (standard or local channels) and B (nonstandard or 
channels that can be borrowed). Subset A is for use only in the nominally assigned cell, 
while subset B is allowed to be lent to neighboring cells. 

The channel borrowing schemes can be categorized as: 

Simple channel borrowing: 

• simple borrowing (SB); 

• borrow from the richest (SBR); 

• basic algorithm (BA); 

• basic algorithm with reassignment (BAR); 

• borrow first available (BFA). 

Hybrid channel borrowing: 

• simple hybrid borrowing scheme (SHCB); 

• borrowing with channel ordering (BCO); 

• borrowing with directional channel locking (BDCL); 

• sharing with bias (SHB); 

• channel assignment with borrowing and reassignment (CABR); 

• ordered dynamic channel assignment with rearrangement (ODCA). 

In the next two subsections we discuss the simple and hybrid borrowing schemes in detail. 
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12. 1.3. 1 Simple channel borrowing schemes 

In the simple borrowing (SB) strategy [11-14], a nominal channel set is assigned to a 
cell, as in the FCA case. After all nominal channels are used, an available channel from a 
neighboring cell is borrowed. To be available for borrowing, the channel most not interfere 
with existing calls. Although channel borrowing can reduce call blocking, it can cause 
interference in the donor cells from which the channel is borrowed and prevent future calls 
in these cells from being completed [15]. 

As shown in Singh et al. [14], the SB strategy gives lower blocking probability than 
static FCA under light and moderate traffic, but static FCA performs better in heavy traffic 
conditions. This is due to the fact that, in light and moderate traffic conditions, borrowing 
of channels provides a means to serve the fluctuations of offered traffic, and as long as 
the traffic intensity is low, the number of donor cells is small. In heavy traffic, the channel 
borrowing may proliferate to such an extent, due to channel locking, that the channel usage 
efficiency drops drastically, causing an increase in blocking probability and a decrease in 
channel utilization [16]. 

Because the set of channels that can be borrowed in a cell may contain more than one 
candidate channel, the way a channel is selected from the set plays an important role in the 
performance of a channel borrowing scheme. The objective of all the schemes is to reduce 
the number of locked channels caused by channel borrowing. The difference between them 
is the specific algorithm used for selecting one of the candidate channels for borrowing. 
Along these lines, several variations of the SB strategy have been proposed where channels 
are borrowed from nonadjacent cells [9, 1 1-14]. In the following, we discuss briefly each 
of the proposed schemes. 

12. 1.3.2 Borrow from the richest (SBR) 

In this scheme, channels that are candidates for borrowing are available channels nominally 
assigned to one of the adjacent cells of the acceptor cell [11]. If more than one adjacent cell 
has channels available for borrowing, a channel is borrowed from the cell with the greatest 
number of channels available for borrowing. As discussed earlier, channel borrowing can 
cause channel locking. The SBR scheme does not take channel locking into account when 
choosing a candidate channel for borrowing. 

The basic algorithm (BA) is an improved version of the SBR strategy which takes channel 
locking into account when selecting a candidate channel for borrowing [11,12]. This scheme 
tries to minimize the future call blocking probability in the cell that is most affected by 
the channel borrowing. As in the SBR, channels that are candidates for borrowing are 
available channels nominally assigned to one of the adjacent cells of the acceptor cell. The 
algorithm chooses the candidate channel that maximizes the number of available nominal 
channels in the worst-case nominal cell in distance a to the acceptor cell. 

12. 1.3.3 Basic algorithm with reassignment (BAR) 

This scheme [12] provides for the transfer of a call from a borrowed channel to a nomi- 
nal channel whenever a nominal channel becomes available. The choice of the particular 
borrowed channel to be freed is again made in a manner that minimizes the maximum proba- 
bility of future call blocking in the cell most affected by the borrowing, as in the BA scheme. 
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12. 1.3.4 Borrow first available (BFA) 

Instead of trying to optimize when borrowing, this algorithm selects the first candidate 
channel it finds [11]. Here, the philosophy of the nominal channel assignment is also 
different. Instead of assigning channels directly to cells, the channels are divided into sets, 
and then each set is assigned to cells at reuse distance o\ These sets are numbered in 
sequence. When setting up a call, channel sets are searched in a prescribed sequence to find 
a candidate channel. 

A general conclusion reached by most studies on the performance comparison of the 
previous schemes is that adopting a simple test for borrowing (e.g. borrowing the first 
available channel that satisfies the cr constraint) yields performance results quite comparable 
to systems which perform an exhaustive and complex search method to find a candidate 
channel [9, 1 1—13]. SBR, BA and BFA were evaluated by simulation in Anderson [11] using 
a two-dimensional hexagonal cell layout with 360 service channels. The offered load was 
adjusted for an average blocking of 0.02. The results show that all three schemes exhibit 
nearly the same average blocking probability versus load. The BFA has an advantage over 
the other two in that its computing effort and complexity are significantly less. Here the 
complexity of each algorithm is determined based un the average number of channel tests 
per call while searching for a candidate channel to borrow. In Anderson [11], simulation 
results showed a large variation in the complexity of these algorithms depending on network 
load. For example, for a 20 % increase in the traffic, SBR requires 50 %, and the BA 100 %, 
more channel tests compared with BFA. 

12.1.4 Hybrid channel borrowing schemes 

12. 1.4. 1 Simple hybrid channel borrowing strategy (SHCB) 

In the SHCB strategy [4, 9, 13] the set of channels assigned to each cell is divided into two 
subsets, A (standard) and B (borrowable) channels. Subset A is nominally assigned in each 
cell, while subset B is allowed to be lent to neighboring cells. The ratio A:B is determined 
a priori, depending on an estimation of the traffic conditions, and can be adapted dynamically 
in a scheduled or predictive manner [13]. 

12. 1.4.2 Borrowing with channel ordering (BCO) 

The BCO [9, 13,14] outperforms SHCB by dynamically varying the A:B channel ratio 
according to changing traffic conditions [13, 14]. In the BCO strategy, all nominal channels 
are ordered such that the first channel has the highest priority for being assigned to the 
next local call, and the last channel is given the highest priority for being borrowed by the 
neighboring cells. A variation of the BCO strategy, called BCO with reassignment, allows 
intercellular handoff, that is, immediate reallocation of a released high-rank channel to a 
call existing in a lower-rank channel in order to minimize the channel locking effect. 

12. 1.4.3 Borrowing with directional channel locking (BDCL) 

In this strategy, a channel is suitable for borrowing only if it is simultaneously free in three 
nearby co-channel cells. This requirement is too stringent and decreases the number of 
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channels available for borrowing. In the BDCL strategy, the channel locking in the co- 
channel cells is restricted to those directions affected by the borrowing. Thus, the number 
of channels available for borrowing is greater than that in the BCO strategy. To determine 
in which case a 'locked' channel can he borrowed, 'lock directions' are specified for each 
locked channel. The scheme also incorporates reallocation of calls from borrowed to nominal 
channels and between borrowed channels in order to minimize the channel borrowing of 
future calls, especially the multiple-channel borrowing observed during heavy traffic. 

It was shown in Zhang and Yum [9] by simulation that BDCL gives the lowest blocking 
probability, followed by HCO and FCA, for both uniform and nonuniform traffic. The reduc- 
tion of the blocking probability for BDCL and BCO over FCA for the system in Zhang and 
Yum [9] is almost uniformly 0.04 and 0.03, respectively, for the range of traffic load tested. 

The nonuniform pattern allocation FCA scheme, discussed in the previous section, can 
also be applied in the case of the hybrid channel borrowing strategies. With the use of 
nonuniform pattern allocation the relative performance of the BDCL, BCO and uniform 
FCA schemes remains the same as before, but the traffic-carrying capacity of a system can 
be increased by about 10 %. This advantage is in addition to those gained from the channel 
borrowing strategies [9]. 

12. 1.4.4 Sharing with Bias (SHB) 

In Yum and Wong [17] a scheme of channel borrowing with coordinated sectoring is de- 
scribed. The SHB strategy is similar to the join-biased queue rule, which is a simple but 
effective way to balance the load of servers in the presence of unbalanced traffic. Each cell 
in the system is divided in three sectors, X, Y, Z, as shown in Figure 12.2. Only calls initi- 
ated in one of these sectors can borrow channels from the two adjacent cells neighboring it 
(donor cells). In addition, the nominal channels in donor cells are divided into two subsets, 
A and B, as in the SHCB ease. Channels from set A can only be used inside the donor cell, 
while channels in set B can be loaned to an acceptor cell. 

For the example shown in Figure 12.2 a call initiated in sector X of cell number 3 can 
only borrow a channel from set A of the cells numbered 1 and 2. 




Figure 12.2 Sharing with bias. 



CHANNEL ASSIGNMENT SCHEMES 375 



12. 1.4.5 Channel assignment with borrowing and reassignment (CARB) 

The scheme was proposed in Engel and Peritsky [12] and is statistically optimum in a 
certain min-max sense. Here channels are borrowed on the basis of causing the least harm 
to neighboring cells in terms of future call blocking probability. Likewise, reassignment of 
borrowed channels is done in such a way as to cause maximum relief to neighboring cells. 

12. 1.4. 6 Ordered channel assignment scheme 
with rearrangement (ODCA) 

This scheme, proposed in Kuek [18], combines the merits of CARB and BCO with im- 
provements to yield higher performance. In ODCA, when a call requests service, the base 
station of the cell checks to see if there are any nominal channels available. If there are 
channels available, the user will be assigned one on an ordered basis, as in BCO. Here all 
channels are numbered in predetermined order according to the same criterion as in the 
CARB scheme, and the lowest-numbered available idle channel is always selected. If all 
nominal channels are busy, the cell may borrow a nonstandard channel from a neighboring 
cell. Once a nonstandard channel is assigned, the availability lists of all affected cells where 
the assigned channel can cause interference are updated. Whenever a channel is no longer 
required, the availability lists of the affected cells are updated accordingly. Whenever a 
standard channel is available, the channel reassignment procedure is initiated to ensure 
efficient utilization. If there is a nonstandard channel in use in the cell, the call served by 
that channel is switched to the newly freed standard channel; the necessary availability 
lists are also updated. If no nonstandard channels are used in the cell, a call served by a 
standard channel with lower priority than the newly freed one is switched to the newly freed 
channel [18]. 

The performance of ODCA was studied in Kuek [18] for a highway microcellular en- 
vironment with nonuniform teletraffic load. Performance comparison with the FCA and 
CARB shows significant improvement. The ODCA scheme exhibits better channel uti- 
lization compared with the CARE and FCA; the OUCA scheme also performs better than 
CARE and FCA at blocking probabilities below 0.1. For example, at a blocking probability 
of 0.05, ODCA is capable of supporting 4 % more traffic than CARE and 35 % more traffic 
than FCA [181]. However, the ODCA scheme incurs a higher computational overhead in 
assigning and reassigning channels, and more frequent switching of channels due to the 
reassignment propagation effect. 

12.1.5 Dynamic channel allocation 

Owing to short-term temporal and spatial variations of traffic in cellular systems, FCA 
schemes are not able to attain high channel efficiency. To overcome this, DCA schemes 
have been studied during the past 20 years. In contrast to FCA, there is no fixed relationship 
between channels and cells in DCA. All channels are kept in a central pool and are assigned 
dynamically to radio cells as new calls arrive in the system [19]. After a call is completed, 
its channel is returned to the central pool. 

In DCA, a channel is eligible for use in any cell provided that signal interference con- 
straints are satisfied. Because, in general, more than one channel might be available in the 
central pool to be assigned to a cell that requires a channel, some strategy must be applied 
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to select the assigned channel [8]. The main idea of all DCA schemes is to evaluate the 
cost of using each candidate channel, and select the one with the minimum cost provided 
that certain interference constraints are satisfied. The selection of the cost function is what 
differentiates DCA schemes [8]. 

The selected cost function might depend on the future blocking probability in the 
vicinity of the cell, the usage frequency of the candidate channel, the reuse distance, chan- 
nel occupancy distribution under current traffic conditions, radio channel measurements of 
individual mobile users or the average blocking probability of the system [16]. 

Based on information used for channel assignment, DCA strategies could be classified 
either as call-by-call DCA or adaptive DCA schemes [20, 23, 27]. In the call-by-call DCA, 
the channel assignment is based only on current channel usage conditions in the service area, 
while in adaptive DCA the channel assignment is adaptively carried out using information 
on the previous as well as present channel usage conditions [20] . DCA schemes can be also 
divided into centralized and distributed schemes with respect to the type of control they 
employ. In general the schemes can be classified into the following groups: 

(1) Centralized DCA 

• First available (FA) 

• Locally optimized dynamic assignment (LODA) 

• Selection with maximum usage on the reuse ring (RING) 

• Mean square (MSQ) 

• Nearest neighbor (NN) 

• Nearest neighbor + 1 (NN + 1) 

• 1 -clique 

(2) Distributed DCA 

• Locally packing distributed DCA (LP-DDCA) 

• LP-DDCA with ACI constraint 

• Moving direction (MD) 

(3) CIR measurement DCA schemes 

• Sequential channel search (SCS) 

• MSIR 

• Dynamic channel selection (DCS) 

• Channel segregation 

(4) One-dimensional Systems 

• MINMAX 

• Minimum interference (Ml) 

• Random minimum interference (RMI) 

• Random minimum interference with reassignment (RMIR) 

• Sequential minimum interference SMI 

12.1.6 Centralized DCA schemes 

In centralized DCA schemes, a channel from the central pool is assigned to a call for 
temporary use by a centralized controller. The difference between these schemes is the 
specific cost function used for selecting one of the candidate channels for assignment. 
First available (FA) is the simplest of the DCA schemes strategy. In FA the first available 
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channel within the reuse distance found during a channel search is assigned to the call. The 
FA strategy minimizes the system computational time; and, as shown by simulation in Cox 
and Reudink [8] for a linear cellular mobile system, it provides an increase of 20 % in the 
total handled traffic compared with FCA for low and moderate traffic loads. 

Locally optimized dynamic assignment uses cost function based on the future blocking 
probability in the vicinity of the cell in which a call is initiated [9, 13]. 

12. 1.6. 1 Channel reuse optimization schemes 

The objective of any mobile system is to maximize the efficiency of the system. Maximum 
efficiency is equivalent to maximum utilization of every channel in the system. It is obvious 
that the shorter the channel reuse distance, the greater the channel reuse over the whole 
service area. The cost functions selected in the following schemes attempt to maximize the 
efficiency of the system by optimizing the reuse of a channel in the system area. 

Selection with maximum usage on the reuse ring (RING) selects a candidate channel 
which is in use in the most cells in the co-channel set. If more than one channel has this 
maximum usage, an arbitrary selection among such channels is made to serve the call. If 
none is available, the selection is made based on the FA scheme. 

12. 1.6.2 Mean square, nearest neighbor, nearest neighbor plus one 

The MSQ scheme selects the available channel that minimizes the mean square of the 
distance among the cells using the same channel. The NN strategy selects the available 
channel occupied in the nearest cell in distance > a , while the NN + 1 scheme selects an 
eligible channel occupied in the nearest cell within distance > a + 1 or distance a if an 
available channel is not found in distance a + 1 [8]. 

Computer simulations of FCA, MSQ, NN and NN + 1 strategies show that, under light 
traffic conditions, NN exhibits the lowest blocking rate, followed by MSO, FA and NN + 1 
[20]. When applied to a microcellular system the NN + 1 strategy leads to lower forced 
call termination and channel changing because the mobile unit is more likely to keep the 
same channel when it moves to an adjacent cell [21]. In addition, simulation results of FA, 
RING and NN [8, 22] show that, for both one- and two-dimensional mobile systems, all 
of the above schemes operate at very low blocking rates until the offered traffic reaches 
some threshold. A small increase in the offered traffic above this threshold produces a 
considerable increase in the blocking probability of new calls and results in very little 
increase in the traffic carried by the system. The load at which blocking begins to occur in 
one-dimensional systems [22] is somewhat greater than that in two-dimensional systems 
[8]. Finally, the simulation results in Cox and Reudink [22] show that strategies like RING 
and NN, which use a channel reuse optimization approach, are able to carry 5 % more traffic 
at a given blocking rate of 3 % compared with a channel assignment strategy like FA, which 
does not employ any channel reuse optimization. 

12.1.6.3 1 -Clique 

All four previous schemes employ local channel reuse optimization schemes. A global 
channel reuse optimization approach is used in the 1 -clique strategy. The 1 -clique scheme 
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uses a set of graphs, one for each channel, expressing the non-co-channel interference 
structure over the whole service area for that channel. In each graph a vertex represents a 
cell, and cells without co-channel interference are connected with edges. So, each graph 
reflects the results of a possible channel assignment. A channel is assigned from the several 
possibilities such that as many vertices as possible still remain available after the assignment. 
This scheme shows a low probability of blocking, but when there are a lot of cells the required 
computational time makes quick channel selection difficult [19]. 

12. 1.6.4 Schemes with channel rearrangement 

Compared with FCA schemes, DCA schemes do not carry as much traffic at high blocking 
rates because they are not able to maximize channel reuse as they serve the randomly 
offered call attempts. In order to improve the performance of DCA schemes at high traffic 
density, channel reassignment techniques have been suggested [5, 8]. The basic goal of 
channel reassignment is to switch calls already in process, whenever possible, from the 
channels these calls are using to other channels, with the objective of keeping the distance 
between cells using the same channel simultaneously to a minimum. Thus, channel reuse 
is more concentrated, and more traffic can be carried per channel at a given blocking 
rate. 

12. 1.6.5 Distributed DCA schemes 

Microcellular systems have shown great potential for capacity improvement in high-density 
personal communication networks. However, propagation characteristics will be less pre- 
dictable and network control requirements more complex than in the other systems. Cen- 
tralized DCA schemes can produce near-optimum channel allocation, but at the expense of 
a high centralization overhead. Distributed schemes are therefore more attractive for imple- 
mentation in microcellular systems, owing to the simplicity of the assignment algorithm in 
each base station. 

The proposed distributed DCA schemes use either local information about the current 
available channels in the cell's vicinity (cell-based) [24] or signal strength measurements 
[25]. In cell-based schemes a channel is allocated to a call by the base station at which 
the call is initiated. The difference with the centralized approach is that each base sta- 
tion keeps information about the current available channels in its vicinity. The channel 
pattern information is updated by exchanging status information between base stations. 
The cell-based scheme provides near-optimum channel allocation at the expense of exces- 
sive exchange of status information between base stations, especially under heavy traffic 
loads. 

Particularly appealing are the DCA interference adaptation schemes that rely on signal 
strength measurements where a base station uses only local information, without the need to 
communicate with any other base station in the network. Thus, the system is self-organizing, 
and channels can be placed or added everywhere, as needed, to increase capacity or to im- 
prove radio coverage in a distributed fashion. These schemes allow fast real-time processing 
and maximal channel packing at the expense of increased co-channel interference proba- 
bility with respect to ongoing calls in adjacent cells, which may lead to undesirable effects 
such as interruption, deadlock, and instability. 
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Table 12.1 ACO matrix at base station i 
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12.1.7 Cell-based distributed DCA schemes 

Local packing dynamic distributed channel assignment (LP DDCA) is a scheme where 
each base station assigns channels to calls using the augmented channel occupancy (ACO) 
matrix, which contains necessary and sufficient local information for the base station 
to make a channel assignment decision. For M available channels in the system and k 
neighboring cells to cell i within the co-channel interference distance, the ACO matrix 
is shown in Table 12.1. It has M + 1 columns and + 1 rows. The first M columns 
correspond to the M channels. The first row indicates the channel occupancy in cell i 
and the remaining k t rows indicate the channel occupancy pattern in the neighborhood 
of i, as obtained from neighboring base stations. The last column of the matrix corre- 
sponds to the number of current available channels for each of the + 1 co-channel cells. 
So, an empty column indicates an idle channel which can be assigned to cell i. When a 
call requests service from cell i, its base station will use the ACO matrix to assign the 
first channel with an empty column. The content of the ACO table is updated by collect- 
ing channel occupancy information from interfering cells. Whenever a change of channel 
occupancy happens in one cell, the base station of the cell informs the base stations of 
all the interfering cells about the change in order to update the data in the local ACO 
matrices. 

In addition to constraining co-channel interference, the design of a wireless cellular 
systems must also include measures to limit adjacent channel interference (ACT). In Cox 
and Reudink [8] a modified version of the LP-DDC A scheme was proposed that incorporates 
the ACI constraint. 

The Moving direction (MD) strategy was proposed in Okada and Kubota [24] for one- 
dimensional microcellular systems. In these systems, forced call termination and channel 
changing occur frequently because of their small cell size [24]. The MD strategy uses 
information on moving directions of the mobile units to decrease both the forced call 
termination blocking probability and the channel changing. An available channel is selected 
among those assigned to mobile units that are elsewhere in the service area and moving in 
the same direction as the mobile in question. The search for such a channel starts from the 
nearest noninterfering cell to the one where the new call was initiated, and stops at the cell 
that is a reuse distances away, where a is a parameter. 

A channel assignment example is given in Figure 12.3 where b,c,d and e are the available 
channels, and DR is the minimum reuse distance. For this example the parameter a is set to 
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Figure 12.3 Moving direction strategy illustration. 



one. The new call attempt is assigned channel b because the mobile requesting the channel 
is moving in the same direction as the mobile in cell number 5. 

The sets of mobiles moving in the same direction and assigned the same channel are thus 
formed. Thus, when a mobile of a set crosses a cell boundary it is likely that a same set of 
mobiles has already crossed out of its cell to the next cell. In this manner a mobile can use 
the same channel after handoff with higher probability. This lowers the probability of both 
changing channels and forced call termination. The strategy is efficient in systems where 
mobiles move at nearly the same speed through the cells laid along a road or a highway and 
for one-dimensional microcellular systems. The simulation results in Okada and Kubota 
[24] for a one-dimensional system show that the MD strategy provides lower probability 
of forced call termination compared with the NN, NN + 1 and FCA strategies. Although 
the MD scheme has attractive features, it is not obvious how it could be expanded to a 
two-dimensional system. 



12.1.8 Signal strength measurement-based distributed DCA schemes 

A large body of research has been published on the performance analysis of traditional 
channel allocation schemes, both FCA and DCA [2, 4, 26], in which knowledge of the 
mobile's locations is not taken into account. In all of these schemes, channels are allocated to 
cells based on the assumption that the mobile may be located anywhere within the boundary 
of the cell. Thus, the packing of channels is not maximal. These schemes suffer from the 
fact that the selected fixed reusability distance might be too pessimistic. In the interference 
adaptation schemes, mobiles measure the amount of co-channel interference to determine 
the reusability of the channel. If a mechanism is assumed to exist by which mobiles and 
base stations can measure the amount of interference, then maximal channel packing could 
be achieved. An example of a system based on this principle is the Digital European Cordless 
Telecommunications (DECT) standard. However, local decisions can lead to suboptimal 
allocation. In interference adaptation DCA schemes, mobiles and base stations estimate 
CIR and allocate a channel to a call when predicted CIRs are above a threshold. It is 
possible that this allocation will cause the CIR of established calls to deteriorate, in which 
case a service interrupt occurs. If the interrupted call cannot find an acceptable new channel 
immediately, the result is a premature service termination, referred to as deadlock. Even if 
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the interrupted call finds an acceptable channel, setting up a link using the new channel can 
cause interruption of another established link. These successive interruptions are referred 
to as instability. If no channel is available for the initial call request, the call is blocked. 

12. 1.8. 1 Sequential channel search 

The simplest scheme among the interference adaptation DCA schemes is the SCS strategy; 
where all mobile-base station pairs examine channels in the same order and choose the first 
available with acceptable CIR. It is expected that SCS will support a volume of traffic by 
suboptimal channel packing at the expense of causing many interruptions. 

Dynamic channel selection (DCS) is a fully distributed algorithm for flexible mobile 
cellular radio resource sharing based on the assumption that mobiles are able to measure 
the amount of interference they experience in each channel. In DCS, each mobile station 
estimates the interference probability and selects the base station which minimizes its value. 
The interference probability is a function of a number of parameters, such as the received 
signal power from base stations, the availability of channels, and co-channel interference. 
In order to evaluate the interference probability, specific models for each of the above 
parameters should be developed. In Hong and Rappaport [28], models are developed to 
calculate probabilities of channel availability, desired carrier power and the CIR for constant 
traffic load. 

The Channel segregation strategy was proposed in Akaiwa and Andoh [25] as a self- 
organized dynamic channel assignment scheme. By scanning all channels, each cell selects 
a vacant channel with an acceptable co-channel interference level. The scanning order is 
formed independently for each cell in accordance with the probability of channel selectabil- 
ity, P(i), which is renewed by learning. For every channel i in the system, each cell keeps 
the current value of P(i). When a call request arrives at the base station, the base station 
channel with the highest value of P(i) under observation is selected. Subsequently, the 
received power level of the selected channel is measured in order to determine whether the 
channel is used or not. If the measured power level is below (or above) a threshold value, 
the channel is considered to be idle (or busy). If the channel is idle, the base station starts 
communication using the channel, and its priority is increased. It the channel is busy, the 
priority of the channel is decreased and the next-highest-priority channel tried. If all chan- 
nels are busy, the call is blocked. The value of P(i) and the update mechanism determine 
the performance of the algorithm. In Furuya and Akaiwa [29], P(i) is updated to show the 
successful transmission probability on channel i as follows: 



Here N(i) is the number of times channel i is accessed. In Akaiwa and Andoh [25] the 
update mechanism for P(i) is defined as F(i) — Ns(i)/N(i) where Ns(i) is the number 
of successful uses of channel Simulation results show that interference due to carrier 
sense error is reduced by 1:10 to 1:100 with channel segregation [29]. Also, the blocking 
probability is greatly reduced compared with FCA and DCA schemes. 



P(i) 
Nii) 
Pii) 
N(i) 



[P(i)N(i) + l]/[N(i) + 1] and 
N(i) + 1 if the channel is idle 
[P(i)N(i)]/[N(i) + 1] and 



N(i) + 1 if the channel is busy 
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12.1.9 One-dimensional cellular systems 

All the DDCA schemes described in this section are applicable for one-dimensional cellular 
mobile systems. One-dimensional structures can be identified in cases such as streets with 
tall buildings shielding interference on either side, see Figure 12.1(c). 

The minimum interference (MI) scheme is well known and among the simplest for one- 
dimensional cellular systems. It is incorporated in the enhanced cordless telephone (CT-2) 
and DECT systems. In an MI scheme, a mobile signals its need for a channel to its nearest 
base station. The base station then measures the interfering signal power on all channels not 
already assigned to other mobiles. The mobile is assigned the channel with the minimum 
interference. The order in which mobiles are assigned channels affects the efficiency of 
channel reuse. Taking into consideration the order of service, we discuss three variations of 
the MI scheme: 

• Random minimum interference (RMI) is the scheme where the mobiles are served ac- 
cording to the MI scheme in a random order or, equivalently, in the order in which calls 
arrive in the system. 

• Random minimum interference with reassignment (RMIR) is a modification where mo- 
biles are first served according to the RMI scheme. Each mobile is then reassigned a 
channel by its base station according to the MI scheme. Those mobiles denied service by 
the initial RMI scheme also try to obtain a channel again. The order in which mobiles are 
reassigned is random. The number of times this procedure is carried out is the number of 
reassignments, R. 

• Sequential minimum interference (SAI) is a modification where mobiles are assigned 
channels according to the MI scheme in a sequential order. The sequence followed is 
such that any mobile is served only after all the mobiles that are ahead of it have had 
a chance to be served. This procedure would require some coordination between base 
stations because of the sequential order of service. 

In general, there is a trade-off between quality of service, the implementation complexity 
of the channel allocation algorithms and spectrum utilization efficiency. Simulation [4, 7, 8] 
results show that, under low traffic intensity, DCA strategies performs better. However, FCA 
schemes become superior at high offered traffic, especially in the case of uniform traffic. In 
the ease of nonuniform traffic and light to moderate loads, it is believed that the DCA scheme 
will perform better due to the fact that, under low traffic intensity, DCA uses channels more 
efficiently than FCA. 

In DCA, the assignment control is made independently in each cell by selecting a vacant 
channel among those allocated to that cell in advance. In DCA the knowledge of occupied 
channels in other cells as well as in the cell in question is necessary. The amount of control 
is different in each DCA strategy. If the DCA requires a lot of processing and complete 
knowledge of the state of the entire system, the call setup delay would be significantly long 
without high-speed computing and signaling. The implementation complexity of the DCA 
is discussed in Vucetic [30]. 

Hybrid channel assignment schemes are a mixture of the FCA and DCA techniques. In 
HCA, the total number of channels available for service is divided into fixed and dynamic 
sets. The fixed set contains a number of nominal channels that are assigned to cells as in 
the FCA schemes and in all cases are to be preferred for use in their respective cells. The 
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second set of channels is shared by all users in the system to increase flexibility. When a 
call requires service from a cell and all of its nominal channels are busy, a channel from the 
dynamic set is assigned to the call. The channel assignment procedure from the dynamic 
set follows any of the DCA strategies described in the previous section. For example, in the 
studies presented in References [4, 31], the FA and RING strategies are used, respectively, 
for DCA. Variations of the main RCA schemes include HCA with channel reordering [31] 
and HCA schemes where calls that cannot find an available channel are queued instead of 
blocked [5]. The call blocking probability for an HCA scheme is defined as the probability 
that a call arriving at a cell finds both the fixed and dynamic channels busy. Performance 
evaluation results of different HCA schemes have been presented in References [4-6, 32]. 
In Kahwa and Georganas [4], a study is presented for an HCA scheme with Erlang-b service 
discipline for cells of uniform size and shape where traffic is uniformly distributed over the 
whole system. The measure of interest is the probability of blocking as the load increases 
for different ratios of fixed to dynamic cells. As shown in Kahwa and Georganas [4], for 
a system with fixed to dynamic channel ratio 3:1, the HCA gives a better grade of service 
than FCA for load increases up to 50 %. Beyond this load HCA performs better in all cases 
studied in Kahwa and Georganas [4] . A similar pattern of behaviour is obtained for an HCA 
scheme with an FA DCA scheme and Ertang-c service discipline (calls that cannot find 
an available channel are queued instead of blocked). In addition, the HCA scheme with 
Erlang-c service discipline [5] has a lower probability of blocking than the HCA scheme 
with Erlang-b service discipline [4]. This phenomenon is expected because in the former 
case calls are allowed to be queued until they can be served. 

In Yue [32], two different approximating models were presented. In the first model the 
traffic offered in the dynamic channels is modeled as an interrupted Poisson process, while 
the second used GI/M/m(m). The blocking probability vs the arrival rate for both models 
presents the same pattern of behavior as the simulation results of References [4, 5]. 

Flexible channel allocation (F1CA) schemes divided the set of available channels into 
fixed and flexible sets. Each cell is assigned a set of fixed channels that typically suffices 
under a light traffic load. The flexible channels are assigned to those cells whose channels 
have become inadequate under increasing traffic loads. The assignment of these emergency 
channels among the cells is done in either a scheduled or predictive manner [33]. In the 
literature, proposed F1CA techniques differ according to the time at which and the basis on 
which additional channels are assigned. 

In the predictive strategy, the traffic intensity or, equivalently, the blocking probabil- 
ity is constantly measured at every cell site so that the reallocation of the flexible chan- 
nels can be carried at any point in time [16]. Fixed and flexible channels are determined 
and assigned (or released) to (or from) each cell according to the change in traffic in- 
tensity or blocking probability measured in each cell. The number of dynamic channels 
required in a cell is determined according to the increase in measured traffic intensity. 
The acquired flexible channels can be used in a manner identical to the fixed channels in 
a cell as long as the cell possesses the channels. As long as a cell has several free fixed 
channels, no flexible channels are assigned to it if the traffic intensity is below a certain 
threshold [33]. 

If the flexible channels are assigned on a scheduled basis, it is assumed that the variation 
of traffic, such as the movement of traffic peaks in time and space, are estimated a priori. 
The change in assignment of flexible channels is then made at the predetermined peaks of 
traffic change [16]. Flexible assignment strategies use centralized control and require the 
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Figure 12.4 Concentric sub-cells. 

central controller to have up-to-date information about the traffic pattern in its area in order 
to manage the assignment of the flexible channels [16]. 

Reuse partitioning (RUP) is an effective concept to obtain high spectrum efficiency in 
cellular systems. In RUP, as shown in Figure 12.4, each cell in the system is divided into 
two or more concentric subcells (zones). Because the inner zones are closer to the base 
station located at the center of the cell, the power level required to achieve a desired CIR in 
the inner zones can be much lower compared with the outer zones. Thus, the channel reuse 
distance (i.e. the distance between cells using the same channel) can be smaller for the inner 
zones than for the outer ones, resulting in higher spectrum efficiency. Reuse partitioning 
schemes could be divided into fixed and adaptive. 

Fixed reuse partitioning 

• Simple reuse partitioning. 

• Simple sorting channel assignment algorithm. 

Adaptive reuse partitioning 

• Autonomous reuse partitioning ARP. 

• Flexible reuse FRU. 

• DDCA. 

• All channel con-centric allocation ACCA. 

• Self organized reuse partitioning SPRP. 

12.1.10 Fixed reuse partitioning 

Simple reuse partitioning was introduced in Halpern [35] . In this scheme, available channels 
are split among several overlaid cell plans with different reuse distances. The underlying 
principle behind RUP is to reduce signal-to-interference ratio (SIR) for those units that 
already have more than adequate transmission quality while offering greater protection to 
those units that require it. The goal is to produce an overall SIR distribution that satisfies 
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system quality objectives while bringing about a general increase in system capacity. For 
the same SIR objective, those partitioning have the potential to obtain a significant increase 
in system capacity when compared with a system that uses only a single reuse factor [35]. 

Simple RUP can be implemented by dividing the spectrum allocation into two [35] or 
more [34] groups of mutually exclusive channels. Channel assignment within the z'th group 
is then determined by the reuse factor N for that group. Mobile units with the best received 
signal quality will be assigned to the group of channels with the smallest reuse factor value, 
while those with the poorest received signal quality will be assigned to the group of channels 
with the largest reuse factor value. As the received signal quality for a mobile unit changes, 
it can be handed off to 

(1) a channel that belongs to a different reuse group on the same zone at the same cell; 

(2) a channel that belongs to the same or to a different group on another zone at the 
same cell; 

(3) a channel belonging to the same or a different group at another cell. 

Typically, the mobile units closer to a cell site will be served by channels from a group 
having a small value of N [35]. 

There are two main design issues related to the simple RUP concept. The first issue is 
the capacity allocation problem, which is to decide how many channels should he assigned 
to each zone. The second issue is the actual assignment of channels to calls. In Zande 
and Frodigh [34] the performance limits of the RUP concept were explored, and methods 
for allocating capacity to the different cell zones as well as optimum real-time channel 
assignment schemes have been presented. 

The simple sorting channel assignment algorithm also divides each cell into a number 
of cocentric zones and assignes a number of channels, as in simple RUP. For each mobile 
in the cell, the base station measures the level of SIR and places the measurements in a 
descending order. Then it assigns channels to the set of at most M mobiles with the largest 
values of SIR, where M is the number of available channels in the entire cell. The mobile 
in the set with the smallest value of SIR is assigned a channel from the outer cell zone. 
The assignment of mobile channels according to ascending values of SIR continues until 
all channels from the outer zone are used. The base station continues to assign channels in 
the next zone, and so on, until all mobiles in the set have been assigned channels. 

The simple sorting channel algorithm achieves almost optimum performance. It also 
allows 1.4-3 times more traffic than the FCA scheme. An important remaining issue is that 
the sorting scheme only determines which cell plan each mobile should use. It does not 
assign actual channels, which must be done with some care. In addition, if all cells using a 
certain channel group started the channel assignment by using the first channel in the group, 
we would get an exceptionally high interference level on that particular channel. A random 
selection procedure would be one way to solve this problem. 

12.1.11 Adaptive channel allocation reuse partitioning (ACA RUP) 

In this case any channel in the system can be used by any base station, as long as the required 
CIR is maintained. Autonomous reuse partitioning is based on the RUP concept and real- 
time CIR measurements. In this technique, all the channels are viewed in the same order 
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by all base stations, and the first channel which satisfies the threshold condition is allocated 
to the mobile attempting the call. Therefore, each channel is reused at a minimum distance 
with respect to the strength of the received desired signal. In ARP base stations conduct 
their allocations independent of one another, and no cooperative control is necessary. 

As compared with simple FCA, ARP doubles the traffic handling capacity of the system 
and decreases the co-channel interference by 1:4 [36]. ARP improves the traffic handling at 
the cost of the SIR margin in each channel. This creates problems for fast moving mobile 
stations such as car-mounted units, which suffer from rapid fluctuations in signal level. 

The flexible reuse (FRU) scheme is based on the principle that, whenever a call requests 
service, the channel with the smallest CIR margin among those available is selected. If there 
is no available channel, the call is blocked. Simulations in Onoe and Yasuda [37] showed 
that FRU can effectively improve system capacity, especially for users with portable units. 
More specifically, a capacity gain of 2.3-2.7 of FRU over FCA was observed. However, 
the FRU strategy requires a large number of CIR measurements, which makes it virtually 
impractical for high-density microcellular systems. 

The self-organized reuse partitioning scheme (SORP) uses the method where each base 
station has a table in which average power measurements for each channel in its cell and 
the surrounding cells are stored. When a call arrives, the base station measures the received 
power of the calling mobile station (in order to define at which subcell the mobile station 
is located) and selects a channel, which shows the average power closest to the measured 
power. The channel is used if available, otherwise the second closest candidate is tried. 
The content of the table for the chosen channel is updated with the average value of the 
measured power and the power of the mobile stations using the same channel. The power 
level of the other mobile stations is broadcast by their base station. As a consequence of 
this procedure, in each base station channels that correspond to the same power are grouped 
autonomously for self-organized partitioning. 

In Chawla and Qiu [85], a performance comparison is made between SORP, conventional 
ARP and random DCA schemes. The simulation analysis showed that SORP and ARP show 
almost the same performance, which is far superior to random DCA. SORP can reduce the 
occurrence of intracell handoff and can reach a desired channel quickly, while achieving 
high traffic capacity. The essential difference between ARP and SORP is that ARP always 
senses the channels in the same order until one is available, while SORP learns which 
channel is proper for the calling mobile, so it can find a desired channel more quickly. 

All channel concentric allocation (ACCA) is an extension of the RUP concept. All radio 
channels of a system are now allocated nominally in the same manner for each cell, as in 
Figure 12.5. Each cell is divided into N concentric regions. Each region has its own channel 
allocation. Here, each channel is assigned a mobile belonging to the concentric region in 
which that channel is allocated, and has a specific desired signal level corresponding to 
the channel location. Therefore, each channel has its own reuse distance determined from 
the desired signal level. Thus, ACCA accomplishes effective channel allocation in a global 
sense, although it is a self-organizing distributed control algorithm. Computer simulations 
have shown that the system capacity at a blocking rate of 3 % is improved by a factor of 
2.5 compared with the FCA [38]. If, in addition, a transmitter power control is implemented 
on top of ACCA, the system accomplishes a capacity 3.4 times greater than FCA. 

Distributed control channel allocation (DCCA) is a dynamic channel allocation scheme 
based on the ARP concept. In this scheme all cells are identical, and channels are ordered 
in the same manner, starting with channel number one, by all the base stations in the 
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Figure 12.5 Principle of the all-channel concentric allocation. 




Figure 12.6 DCCA cell structure. 

network. The decision to allocate a channel is made locally, based on CIR measurements. 
The architecture of a cell in DCCA is shown in Figure 12.6. It consists of an omnidirectional 
central station connected to six symmetrically oriented substations [39, 40]. The substations 
are simple transceivers, and can be switched on and off under the control of the main station. 
When the traffic density of the cell is low, all the substations are off and the only operating 
station is the main station, at the center of the cell covering the entire cell area. Gradually, 
as call traffic increases, forced call blocking will occur due to an unacceptable level of 
co-channel interference or the unavailability of resources. In this case, the main base station 
switches on the nearest substation to the mobile unit demanding access. This in effect 
relocates the main base station closer to the mobile requesting service: therefore, CIR 
measurements will now be higher, thus improving the probability of finding an acceptable 
channel. If the traffic is reduced, the main station switches off a number of substations. 
The system therefore automatically adapts itself to time-variant cell traffic density. As a 
result, an improvement in both system efficiency and traffic capacity can be achieved. As 
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discussed in Madam and Aghvami [40], the DCCA system results in lower probability of 
forced termination of calls. Computer simulation showed a drastic reduction in the number 
of handoffs and almost 50 % less forced termination of calls compared with the ARP scheme. 



12.2 RESOURCE MANAGEMENT IN 4G 

Based on the discussion presented so far in Chapters 11 and 12 an integrated bandwidth 
management approach is now presented that can be implemented in next generation wireless 
networks that support multimedia services (data, voice, video, etc.). The main principles of 
the approach are summarized as follows: 

(1) The system supports multiple classes of service that require various levels of quality 
that may range from strict to flexible and soft QoS requirements. 

(2) Advanced bandwidth reservation is performed to the cell a mobile user is moving 
towards in order to assist and support a seamless handoff process. 

(3) User mobility is introduced into the reservation scheme and process in order to 
optimize the efficiency of handoff mechanisms and minimize, if not eliminate, the 
unnecessary reservation of resources and, therefore, improve the system capacity 
and throughput. 

(4) Bandwidth reconfiguration processes are used that may allow the efficient resource 
redistribution in a cell to satisfy the QoS requirements of all the mobile users in 
the cell, especially when users with flexible QoS requirements are supported in the 
system. 

(5) A mobile agent based framework is used to facilitate the efficient implementation 
of the above integrated approach. 

One of the elements in future resource management is mobile locationing and tracking. Po- 
sition location technology can be loosely classified into two major categories: mobile-based 
solution and network-based solution. The global positioning system (GPS) is a worldwide 
radio-navigation system formed from a constellation of 24 satellites and their ground sta- 
tions. As an alternative to mobile-based approaches, cellular networks can be used as the 
sole means of providing location services, where the MSs are located by measuring the 
signals traveling to and from a set of fixed cellular BSs. The signal measurements are used, 
for example, to determine the length and/or direction of the individual radio paths, and 
then the MS position is computed from geometric relationships. Basically, radiolocation 
systems can be implemented based on either signal strength, angle of arrival (AOA), or time 
of arrival (TO A) measurements or their combinations [41, 42]. 

However, there are several problems associated with the existing network-based geolo- 
cation systems which limit and prevent the use of location information of mobile users for 
network management purposes. First, the triangulation measurement data collecting pro- 
cedure is quite complicated. At least two or three BSs should be involved in locating the 
position of a mobile user. Each BS will perform some triangulation measurement. Then, all 
this data has to be sent to some device to carry out the calculation. This procedure requires 
some specialized protocol to support the information exchange. This also would waste some 
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bandwidth in data transmission and introduce some delay to the overall location algorithm. 
Nowadays, for some practical systems, it may take several seconds to locate a mobile user. 
So, if the user moves with high speed, we cannot obtain an accurate position information 
in real time. Furthermore, performing the calculation at BS instead of MS raises some 
difficulties in obtaining accurate radio parameters of an MS that may vary with time and 
position. Finally, if all the mobile users in the network need geolocation information, the 
computational load at one BS becomes very heavy. Such an overload condition may lead 
to high processing delays or failures of the location module in the BSs. 

By moving the measurement and calculation functionality to the mobile station, all the 
information needed for the geolocation can be gathered almost at one time and then be 
treated locally. In order to allow the geolocation system to have more flexibility so that the 
MS can smoothly roam among different networks with different geolocation approaches, 
a mobile agent-based system framework can be used. The use of an approach based on 
code-on-demand increases the flexibility of the system and maintains agents simple and 
small at the same time. This is an element of a broader concept of active networks that will 
be discussed later in Chapter 16. In this section, one of the basic ideas introduced is that, 
when the MS is turned on or roaming into a network with different geolocation approach, it 
sends a message agent (MA) to the BS. Then, the BS will generate a geolocation agent (GA) 
and send it to the mobile station. The GA should contain the signal processing algorithm 
for signal measurement and the triangulation algorithm. By using this framework, mobile 
stations can obtain all the data for triangulation calculation almost at the same time and 
carry out the geolocation calculation locally. 



12.3 MOBILE AGENT-BASED RESOURCE MANAGEMENT 

In general, a major incentive for an agent-based approach is that policies can be implemented 
dynamically, allowing for a resource-state-based call admission, reservation and network 
management strategy. Agents are used to discover resources available inside the network 
and claim resources on behalf of customers according to some 'figures of merit' [43], 
which represent tradeoffs between bandwidth claimed and loss risk incurred due to high 
utilization. Agents are able to trigger adaptation of applications inside the network on behalf 
of customers. This allows for an immediate response to resource shortages, decreases the 
amount of useless data transported, and reduces signaling overhead. Mobile agents provide 
the highest possible degree of flexibility and can carry application-specific knowledge 
into the network to locations where it is needed. In this section, we provide a detailed 
description of the mobility-assisted handoff/bandwidth reservation/call admission control 
scheme which can provide a flexible QoS management strategy in wireless networks. We 
also present a mobile agent-based framework and demonstrate its use in the implementation 
of the integrated strategy. 

The agents used in the system and their corresponding roles are as follows: 

• Message agents (MAs) are used to exchange information and management data among 
agents and managers. They are created and received by message managers (MMs). MM 
can also forward messages contained by MAs to the corresponding managers or agents 
in the same network element. 
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• Geolocation agents (GAs) are created by geolocation managers (GM) in base stations 
and are sent to mobile stations. They contain the signal processing algorithm for signal 
measurement and the triangulation algorithm used in the current networks. 

• Bandwidth reservation agents (BRAs) are created by bandwidth reservation managers 
(BRMs) and are sent to mobile stations. They contain the bandwidth reservation algorithm 
used in the current networks. 

• Call admission control agents (CACAs) are created by call admission control managers 
(CACMs) embedded in base stations and are sent to mobile stations. The CAC strategy 
can be flexibly deployed in this way. CACAs will collect customer requirement and send 
necessary information by MAs to CACMs. The CAC algorithm is carried by CACMs. 

When a mobile station is turned on, an MA is sent out to the base station containing 
initial information about the mobile station. MM in the base station relays the information 
in MA to CACM, GM and BRM. CACA will create a new CACA and send it back to the 
mobile station. CACA will collect user information about the QoS requirement and send it 
with call-in requirement back to CACM by MA. The CAC algorithm embedded in CACM 
will make the decision of accepting the new call or not. Once the new call is accepted, GM 
and BRM also will create a new GA and a new BRA, respectively, and send them back to the 
mobile station. Within the mobile station, GA can output the position of mobile station itself 
periodically and this position information is input into the position information processing 
module. This module then provides mobility information about mobile stations to BRA. 
BRA can use mobility information to calculate the bandwidth required for handoff purposes. 
The bandwidth reservation is also sent to the BS by MA. Finally, BRM at base station will 
summarize the bandwidth needed by each of its neighboring cells and send each neighboring 
cell an MA containing the corresponding bandwidth reservation requirements. BRM at 
base station in the neighboring cell can use this result to perform the bandwidth reservation 
process. In the case of handoff, a handoff requirement and also the QoS requirement will be 
sent by MA to the CACM embedded in the target base station. The pool used by bandwidth 
reconfiguration procedure is maintained by CACM. After bandwidth reallocation, CACM 
will send out message agents to MSs informing them about the reconfiguration results. 

For the mobility predictive resource management scheme in a centralized management 
architecture, the following two modules are required: geolocation module and bandwidth 
reservation computation module. In CS (traditional centralized client-server) approach, 
the management primitives are often low level and fixed, and no semantic compression is 
allowed to be performed. As a result, these two modules work independently. The only 
entity that these two module can operate with is the management information base (MIB) 
embedded in the mobile station. The geolocation module periodically updates the position 
information in MIB, while the bandwidth reservation module will access the MIB for 
position information to carry out the prediction and calculate the bandwidth required by 
the mobile devices under consideration. In the mobile agent based approach, the agent has 
the ability to perform the semantic compression. This means that all the simple steps that 
should be processed one by one in CS mode now can be integrated into some agent that can 
carry out a more complicated task autonomously. The agents move to the mobile station 
and carry out the functionality of geolocation, bandwidth prediction locally. As a result, we 
should expect that the interactive traffic between mobile station and network management 
station can be reduced dramatically. 
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The various steps involved in the centralized management approach and the correspond- 
ing control messages involved are summarized as follows: 

(1) Geolocation - whenever a mobile station needs to update its position information in 
the MIB, it sends out a request I mn to the corresponding NMS (network management 
station), which in turn sends out requests I nD to /3 different base stations near the 
mobile station (the number of /3 depends on the geolocation technology). After each 
one of these base stations obtains the corresponding measurement results, they return 
the measurements R/,„ to the NMS. The NMS then uses the results to perform the 
triangulation computation and estimate the position of the mobile station. Finally, 
the position result R„ m is returned to the mobile station for MIB update. The traffic 
generated in this phase is: C geo = I m „ + pi nh + fiRbn + Rnm- 

(2) Position prediction and bandwidth reservation calculation - when the NMS wants to 
predict the mobility of each mobile station, the current position and a few historical 
positions of MS are required. Since in the CS approach no semantic compression 
is allowed to be performed, every request for the position information (current or 
historical) of mobile stations will generate an inquiry I nm to mobile station to search 
its MIB. The mobile station then sends back the results R mn one by one to the NMS. 
Finally, the NMS can use this information to do the prediction and calculate the 
bandwidth needing to be reserved in neighboring cells. 

So in this phase, the traffic generated by CS mode management is: C pos = al nm + aR mn , 
where a is the total new updates (recent positions and current position in the MIB of mobile 
station) required for prediction. Please note here that the prediction and reservation interval 
maybe different from the position calculation/update interval and, as a result, its is possible 
that some recent positions (stored in the MIB of the mobile node) along with the current 
one may have to be sent to the NMS as required by the bandwidth reservation algorithm. 

Assuming that the average call life time is 7j and the geolocation/prediction update inter- 
val is f u , then in the CS mode the total control traffic generated for the resource management 
is: 

C(cs) — (Cg e o + Cpos) 

— (Jmn ~t~ 3I n b -\~ 3 Rfr n -\~ Rnm ~t~ ^^nm ~t~ & Rnm ) 

Similarly, the various steps involved in the mobile agent-based approach and the corre- 
sponding control messages involved can be summarized as follows: 

• Initial phase - agent(s) transport. When a mobile station is initialized, or whenever the 
mobile station enters a new network domain that supports different geolocation method or 
bandwidth calculation/reservation method, the resource management agents (including 
geolocation agent A geo , bandwidth reservation agent A DW rsv and call admission control 
agent A cac ) will be downloaded from the NMS. The total traffic introduced in this initial 
phase is equal to: A geo + A bwrsv + A cac . 

• Phase 1 - geolocation phase. Since the geolocation module is included in the agent and 
is executed locally, no traffic is generated in the network. The geolocation results can be 
written into the MIB locally. 
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• Phase 2 - prediction phase. Since the prediction module has already been included in 
the agent code, the prediction can be made locally without generating any traffic in the 
network. 

• Phase 3 - bandwidth reservation. The bandwidth reservation agent calculates the band- 
width needed to be reserved in a certain cell and sends the reservation result R m b via the 
message agents to BS. 

Therefore, after initialization phase of agent download, the total communications control 
traffic generated for resource management purposes is only: C(ma) — T\R m t,/t a . We should 
note here that, once the agents are embedded (downloaded) in the mobile station, unless a 
new resource management strategy is needed (when a management system is upgraded or 
the mobile stations travels to a new foreign domain), we do not need to transport the code of 
the agent again. Since all the involved agents are, in general, very small in size (up to a few 
kbytes), except for the graphical interface which, however, can vary from implementation 
to implementation and in any case does not need to migrate, it is expected that the migration 
times (e.g. message agents) and corresponding consumed energy are very low as well. Based 
on the above discussion and arguing that the size of R m \, is of the same order of message 
R mn , we can easily conclude that, after the initialization phase, the control traffic generated 
between the mobile stations and base stations is reduced significantly by taking advantage 
of the mobile agent technology. 

Security is another crucial aspect of agent systems. An adequate security model is re- 
quired in order to insure a high level of protection to the agents and the nodes, so that nodes 
can be protected from the attacks of unsafe agents, and the agents can run on the nodes of 
the network without being damaged [44]. These issues are discussed in Chapter 15. 

12.3.1 Advanced resource management system 

In this section we discuss performance of an advanced resource management system (arm) 
using: 

(1) The predictive mobility-based bandwidth reservation scheme (PMBBR), as de- 
scribed in Chapter 1 1 ; and 

(2) QoS management with flexible bandwidth sharing (FBS). 
For the system with two classes of traffic, FBS operates as follows: 

• Class 1 (higher priority) - the desired bandwidth for this kind of traffic is BW\. If class 1 
traffic cannot obtain the desired bandwidth, it may have the option to continue at a lower 
bandwidth requirement BW\. For instance, this can be achieved by adjusting the coding 
rate so that the video/audio quality is still acceptable (i.e., real-time traffic). 

• Class 2 (lower priority ) - the desired bandwidth for this kind of traffic is B W" and there 
are no strict QoS requirements. However, some flexible QoS requirements are defined 
for such a service. The user could specify a set of acceptable QoS levels that correspond 
to bandwidth requirements that range from a lower bound bandwidth requirement B W\ 
to a maximum bandwidth requirement B W£ and expect a QoS varying in the specified 
range (i.e. nonreal-time traffic). 

Such an arm system is compared with the corresponding results of a conventional system 
where the fixed bandwidth reservation is implemented, in terms of achievable new call (P„b) 
and handoff Phb call blocking probabilities. Instead of two or more classes of users, the 
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Table 12.2 Simulation parameters 



Parameter 


Value 


Description 


BW, , i 

" " total 


1 000 kbs 


Total bandwidth canacitv of a fell 


BW{ 


30 kbs 


Desired bandwidth requirement for class 1 users 




25 kbs 


Lower bound bandwidth requirement for class 1 users 


BWl 


50 kbs 


Desired bandwidth requirement for class 2 users 


BW[ 


5 kbs 


Lower bound bandwidth requirement for class 2 users 



concept of flexible bandwidth sharing will be extended in 4G to different systems or even 
different operators. The wireless network used in this study is composed of 37 hexagonal 
cells. The cell radius is set to be 1000 m. In order to approximate the performance of a large 
cellular system, the cells are wrapped around to eliminate the border effect. The arrival of 
new calls initiated in each cell forms a Poisson process with rate A. The life time of each 
call is exponentially distributed with mean 240 s. 

Additional system and traffic parameters are summarized in Table 12.2 [122]. It should 
be noted here that the arm framework aims to suggest a general approach that supports 
the seamless operation and provides flexibility for the resource management in the next- 
generation wireless networks that support multimedia services. The bandwidth values for 
the different classes of service were chosen for illustration purposes. New voice coding 
technologies and emerging data applications may bring different bandwidth requirements 
to the wireless networks, which, however, can be easily fit into the framework. Throughout 
the simulation study, it was assumed that the desired bandwidth requirement for class 1 
(e.g. voice) users is 30 kbs and that for class 2 (data) users is 50 kbs. 

The mobility model used this study is as follows: when a new call is initiated, the 
corresponding MS is assigned with a random initial position inside the cell, a random 
moving direction and an initial moving speed which is chosen according to a uniform 
distribution in the interval [0; V max ] mile/h. The speed (v) and direction (9) are updated to 
v' and 9' every time interval Af according to the following model: 

, _ | min {max [v + Av, 0] , V max }, P < 0.9 
1 0 otherwise 

0' = 9 + AO 

where Av models the acceleration/deceleration of the mobile user and is a uniformly 
distributed random variable over the interval [—5 mile/h, 5 mile/h], A9 characterizes the 
user's change in moving direction and is a uniformly distributed variable over the interval 
[— A9 max , A6> max ] and p is a uniformly distributed random variable over the interval 
[0, 1]. The use of variable p allows us to simulate the situation where a mobile user 
may stop occasionally during the course of moving. Based on this mobility model, two 
mobility patterns are considered in this study: high-speed pattern and low-speed pattern. 
In the high-speed pattern, the mobility parameters are V max = 60 mile/h, A# max = ?r/4, 
which correspond to highly directional, fast-moving traffic (e.g. highway traffic); for the 
low-speed pattern, V max = 30 mile/h, A# max = it/2 which corresponds to a less directional, 
slow-moving traffic (e.g. downtown traffic). The mobility update interval (Af) is chosen to 
be 10 s. 
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Numerical results are presented for two different traffic profiles (TP) regarding the com- 
position of class 1 and class 2 traffic. In profile TP1, 10% of the new call attempts are 
class 2 calls, while, in profile TP2, 50 % of the new calls are class 2 calls. Note that in the 
proposed CAC and resource reconfiguration scheme only class 2 calls can lend bandwidth 
to handoff calls and, therefore, the number of ongoing class 2 calls in a cell will influence 
the performance of the system. 

Figure 12.7 compares the handoff call and new call blocking probabilities of the arm 
system with the corresponding results of the conventional system vs k, under TP1 , with users 
moving in the high-speed pattern. In the figures, 'cl' and 'c2' stand for class 1 and class 
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Figure 12.7 Call blocking probabilities for high mobility pattern under traffic profile TP1. 

(a) Call blocking probabilities for class 1 users, (b) Call blocking probabilities 
for class 2 users. 
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Table 12.3 Average bandwidth used by each class 2 user 

A 0.05 0.06 0.07 0.08 0.09 0.10 0.11 

arm(kbs) 50.00 49.60 49.00 48.10 46.85 45.56 42.50 

Conventional (kbs) 50.00 50.00 49.90 49.77 49.75 49.67 49.42 



2 users, respectively. The conventional system uses the fixed bandwidth reservation, where 
the reservation value represents a fixed percentage of the total capacity of the cell. In the 
following study, the corresponding reservation value of the conventional system for class 1 
users is B W res i = 30 kps and for class 2 users is B W res 2 — 50 kps. The reservation values are 
selected based on experimentation with the objective of keeping similar handoff blocking 
probability for both the arm system and the conventional system. The call admission control 
procedure for a conventional system is the same except that the bandwidth reconfiguration 
is not used. From the figure, we observe that, for the given parameters, the arm system and 
the conventional system have similar handoff call blocking probabilities. 

However, the arm system can significantly decrease the new call blocking probabilities 
for both user classes, which demonstrates that it can admit more users than the conventional 
system while still guaranteeing the same level of QoS for handoff calls. One should bear in 
mind that the connection level QoS improvement achieved by the arm system is obtained 
at the cost of slightly decreasing the bandwidth actually used by class 2 users. Table 12.3 
lists the average used bandwidth by each class 2 user. 

Figure 12.8 presents the corresponding numerical results for TP2 where 50 % of the new 
calls belong to class 2 traffic. Under this traffic configuration, the arm system can achieve 
an even better performance improvement. 

Figure 12.9 shows the corresponding blocking probabilities that can be achieved by the 
arm system for different mobility patterns (high vs low) under TP1. From the figure, we 
observe that the arm scheme is capable of achieving good performance even when the users 
move in the low-speed pattern. The arm scheme achieves very low handoff failure rates for 
both user classes, which is similar to the results obtained under the high-speed mobility 
pattern. 

12.4 CDMA CELLULAR MULTIMEDIA WIRELESS NETWORKS 

In cellular segments of wireless IP networks, there is a need for simple radio resource and 
teletraffic management solutions that allow operators to customize various network and 
user profiles, optimize the grade of service (GoS) while fulfilling the QoS requirements. In 
such solutions, CAC for real-time (RT) services and packet access control for nonreal-time 
(NRT) services play key roles. This section 

( 1 ) presents a comparative study of CAC policies in the uplink (UL) of WCDMA cellular 
networks used as segments of the wireless IP networks; 

(2) introduces a simple, effective and robust soft-decision CAC policy that exploits the 
UL interference distributions to compensate fluctuations of the local average SIRs 
dominating QoS while expanding call-admissible region to maximize the system 
capacity; the method requires neither complex measurements nor mutual exchange 
of information between adjacent CDMA cells about the state of the network; this 
is therefore especially applicable in wireless IP networks, consisting of number of 
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Figure 12.8 Call blocking probabilities with high mobility pattern under TP2. (a) Call 
blocking probabilities for class 1 users, (b) Call blocking probabilities for 
class 2 users. 



segments or domains, where the simplicity and the scalability requirements favor 
localized radio resource control; 

(3) introduces multiple call-blocking thresholds or load-based fracturing factors for 
different types of calls, services and subscription classes to provide operators a 
flexible tool for QoS differentiation and GoS management; 

(4) presents a reliable dimensioning of the free capacity for NRT packet-data access and 
estimation of the upper-limit UL data throughput for different packet-lengths and 
data-rates based on asymptotic and quasi- stationary analysis of the RT traffic; 
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Figure 12.9 Call blocking probabilities in arm system for different mobility patterns un- 
der TP1. (a) Call blocking probabilities for class 1 users, (b) Call blocking 
probabilities for class 2 users. 



(5) introduces a dynamic feedback packet access control scheme that adapts to the UL 
free capacity left by the RT traffic with optimal throughput-delay tradeoff; 

(6) provides a simple and accurate analytical method for evaluating the teletraffic per- 
formance of the system with different CAC policies and QoS differentiation, taking 
into account priority handoffs and user mobility equilibrium. 
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The results are important for optimizing the overall Erlang capacity and throughputs with 
differentiated services in WCDMA segments or domains of wireless IP networks for mul- 
timedia applications. 

A variety of emerging technologies and protocol enhancements has been investigated 
and designed to provide IP-based multimedia services to mobile users. These research and 
standardization efforts are summarized in References [45^4-7]. By using the radio resource 
management framework, in this section we address some simple solutions for future network 
operators to customize and optimize the performance of their multimedia networks serving 
various user and network profiles. 

The traffic-engineering framework for wireless multimedia networks must be simple and 
flexible for supporting a variety of services with QoS differentiation to satisfy wide range 
of customer demands and to maximize revenues. The operators need to have an in-hand 
tool to manage their network operation more or less similarly to the airlines customer- 
service systems where passengers are categorized into classes receiving different QoS on 
any required routes, but all sharing expensive limited space of planes. The limitation of 
extremely expensive radio resources is also the bottleneck of future wireless networks. The 
key in providing the QoS for users, while efficiently sharing radio resources to optimize the 
GoS for networks, is the access control at the air interface acting as ticket-selling and check- 
in processes. The access control herein includes CAC for the RT connection-oriented service 
domain and packet access control for the NRT connectionless service domain. Hence, by 
using intelligent access control strategies, we can have an effective solution to the two- 
fold objective: (1) to optimize the network utility; and (2) to provide operators freedom in 
controlling pay-load and group behavior of traffic classes regarding required services as 
well as subscriber potential classes, e.g. gold, silver and bronze. 

To date, a significant number of papers and standards dealing with various issues of 
CDMA wireless networks have been published. Major efforts have been paid to study 
detailed features and specific algorithms, e.g. receivers, power control, handoff, etc. Inves- 
tigations of the CDMA system capacity, radio resource and teletraffic management have 
been mostly based on single-service 'worst case' scenarios, or high-cost simulations [50] . In 
more sophisticated scenarios, the simplicity requirement is usually ignored despite the fact 
that only this condition can ensure system efficiency and guarantee fast response to mul- 
tiple QoS requirements from the users [52]. Examples are References [44, 47, 59], where 
valuable results are presented for measurement-based adaptive admission control and band- 
width management schemes. However, it has been pointed out, e.g. in Reference [47], that 
the price of better performance offered by these schemes is an increased complexity of the 
implementations in both hardware and software. The results on CAC and packet access 
control in CDMA wireless networks are summarized extensively in References [49, 50, 51, 
52, 54]. 

This section emphasizes practical operating aspects of multimedia CDMA mobile cel- 
lular networks, used as segments of the future wireless IP, and the need for simple and 
effective radio resource and traffic handling mechanisms. This is investigated in contexts of 
the access control, which is mainly viewed as an optimal interference management problem 
in CDMA systems. Various CAC policies are considered simultaneously and an effective 
soft-decision mechanism, namely SCAC, is presented. The motivation behind SCAC is to 
use probabilistic functions, adapted to inter-cell interference distributions, for making deci- 
sion upon receipt of a call request in order to optimize the radio resource utilization (RRU) of 
interference-limited CDMA systems, while keeping overall communication quality under 
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acceptable outage levels. This method requires neither complex measurements nor mutual 
exchange of information between the adjacent cells about the state of the network. Thus, 
the complexity and the cost of implementations can be significantly reduced. This is of im- 
portance in the wireless IP networks where a number of segments or domains are involved. 
The scalability also favors distributed or localized control of radio resources. The normal- 
ized load factors, which essentially characterize spread spectrum CDMA transmissions, are 
used to represent the resource consumption, similarly to the well-known effective band- 
width concepts [50, 68]. Because attributes of the required service and adequate transmit 
power control (TPC) schemes can provide and insure a priori knowledge and stationary 
behavior of the resource consumption [41-44, 46, 53], SCAC can be promising in provid- 
ing good multiplexing gain and robustness. Multiple call-blocking thresholds or load-based 
fracturing factors are introduced that give operators a simple and flexible means of differ- 
entiating and/or prioritizing different types of calls (e.g. new call and handoff call), services 
and/or subscription classes for desired serving customs. The performance of CAC policies 
is evaluated with respect to the following: 

(1) effectiveness represented by tradeoffs of the simplicity in implementations and the 
ability in autoconfiguring with various users QoS and networks GoS profiles for max- 
imizing system capacity or revenues; performance is measured by the probabilities 
of call blocking and handoff failure or some additional cost functions; 

(2) reliability measured by the probability of losing communication quality during ser- 
vices, which is also known as the outage probability when the received interference 
exceeds the tolerable level; 

(3) robustness seen against a need for redesign due to uncertainty and changes of the 
system parameters. 

With respect to controlling NRT packet radio access in UL, using the free capacity left by 
the RT traffic , this section provides 

(1) a precise dimensioning of the UL radio resources available for packet access; 

(2) estimates of the average upper-limit UL data throughput for different bit-rates and 
packet-lengths that can be used for tradeoff of the design parameters for packet 
access control schemes; 

(3) the dynamic feedback information-based multiple access scheme (DFIMA), which 
is suitable for QoS differentiation in background services. 

These issues are increasingly important for the efficiency of the RRU in serving mobile 
messaging services. Examples are short message service or wireless e-mail, which have 
been taking the consumer market by storm recently. 

The analytical models are presented based on modified product-form loss network models 
and results of the stochastic knapsack problem [64-67]. The priority handoff and the cell- 
condition of user mobility equilibrium are considered. The UL dynamic resources and 
upper-limit throughput for NRT packet transmissions are estimated using asymptotic and 
quasi-stationary analysis of the RT traffic. This provides a reliable and flexible tool for 
designers and operators in teletraffic engineering of the networks. Only the UL direction 
is chosen to present the results. The traffic asymmetry between UL and DL is therefore 
not taken into consideration. The RT services are of CBR classes, whereas the NRT packet 
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services are of background classes having a lower priority that can tolerate moderate delays. 
The VBR and the ABR interactive services, pre-emption in QoS differentiation, are not 
considered. Despite these limitations, such a practical and comprehensive system model 
and analysis for radio resource and teletraffic management of multimedia CDMA cellular 
networks, with discussion of simple and effective SCAC and DFIMA, should be of interest 
to system designers. Moreover, this section lays the groundwork for investigating more 
advanced control mechanisms as well as more sophisticated traffic models for future wireless 
IP networks. 



12.4.1 Principles of SCAC 

An overview of publications on QoS and resource management including CAC techniques 
in 3G multimedia networks can be found in References [52, 54]. In general, CAC policies 
can be divided into two categories: static and dynamic [59]. From the implementation 
point of view, these can be further divided into modeling-based and measurement-based 
policies [57]. Under the static CAC category, a call request is accepted only when sufficient 
resources are available to meet its required QoS and to maintain the QoS of ongoing calls. 
The admission controller must decide which resource combinations can be accepted into 
the network. In FDMA/TDMA cellular systems, the fixed channel assignment-based policy 
[55] is often adopted because of its simplicity and capacity maximization in line with a fixed 
threshold of the number of channels (cell-capacity). In CDMA cellular systems, such a fixed 
threshold is hard to determine efficiently because the qualitative and quantitative features 
of CDMA systems appear to be heavily inter-twined and mutually supportive. Under the 
dynamic CAC category, a call request can be rejected even if a set of new calls may meet 
their QoS requirements. On the other hand, a new call can be accepted into the network 
even if the instantaneous QoS may be violated, but when averaged over states, the service 
quality is met. 

Therefore, the network utility can be increased and various cost-effective and traffic- 
shaping policies can be adopted. The reinforcement learning-based CAC, described by a 
semi-Markov decision process, is believed to be a good solution for a class of adaptive 
policies [56, 59]. Later in this section, a much simpler and more effective dynamic SCAC 
is presented by exploiting the soft-capacity feature of CDMA radios. 

Consider the UL of a cell in a CDMA cellular system consisting of multiple, identical 
and independent cells using omni-directional antennas in BS. This network is supporting 
M independent connection-oriented RT service classes, each with a constant bit-rate and a 
target value of signal-to-interference ratio (SIR) for meeting its QoS requirements. Refer- 
ence [53] describes the spectral efficiency of an interference-limited CDMA air-interface 
in term of load factors. 

The starting point in this concept is the target signal to noise ratio for a given user k 
defined as 



f J7 / A7 \ GkPk WPk 

(E b /N 0 ) k = — — — = — — = XtargetJ: = Yk 

Pkihol ~ Pk) Pknihot ~ Pk) 

where pt is the voice activity factor, t> data rate, received power, W signal bandwidth 
and 7 t ot includes overall interference, from its own cell, other cells and noise. The previous 
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relation can be also represented as 
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1 + 



YkrtPk 



where w k is referred to as the load factor. 

Let K = {1, 2, . . . ,M}. Define: n = (n k , k e K) is the occupancy vector, where n k 
is the number of class-A: calls in progress; w = (Wk, k € K) is the load vector, where w k 
is the average load factor of a class-/: connection. For all k e K, let r k be the bit-rate, y tar get-£ 
the target SIR and pt the activity factor of class-A: data-sources. The most common way to 
define the average capacity of a cell in CDMA cellular systems is based on the load equation 
in the boundary condition of the cell tolerable (required) interference level, 7 0 req- Let C a be 
an average capacity of a cell. This represents the spectral efficiency and can be expressed 
as follows: 



where 7 own is the own-cell interference level, i.e. the received power caused by signals of 
active users in the cell, and / tota i is the total interference level, i.e. the total received power in 
UL including thermal noise and interference caused by other cells. It is shown in References 
[46, 53] that the mean and the variance of other-cell interference can be approximated by 
own-cell interference multiplied by a constant coefficient /. Let rj be the fraction of the 
thermal noise density with respect to /oreq- The left side of Equation (12.2) can be rewritten 
as [46]: 



Table 12.4 summarizes the parameters needed to calculate w k and C a . 

For example, with / equal to 40 % and rj equal to 10 %, C a is around 65 % and w k 
for a voice-service connection is around 0.5 %, which means the system can serve al- 
most rik — C a /Wk — 130 voice calls simultaneously. It has been shown theoretically and 
by simulations [41^-6] that the values of SIR and / can vary significantly due to the 
changes of propagation parameters, TPC inaccuracy, traffic distributions, etc. For instance, 
SIR has been found in References [42, 46] as a log-normal random variable with total 
error standard deviation of the order of 2 dB. The parameter / varies between 25 and 
55 % in a microcellular environment with omnidirectional antennas [53]. The fluctuations 
of local average SIR and other-cell interference distribution have essential impacts on system 
capacity resulting in the 'soft-capacity' feature of CDMA systems. 

Based on the knowledge of local average SIR and interference distributions, their bounds, 
means and variances, the resource consumption of connection- and cell-basis can be modeled 
as follows. For all k e K, the resource consumption of a class-A: connection is represented by 
average load factor of its service class, which is a stationary, independent and bounded 




(12.2) 




(12.3) 



402 ADAPTIVE RESOURCE MANAGEMENT 



Table 12.4 Parameters for microcellular multiservice WCDMA system 



Parameter 


Definition 


Values 


W 


WCDMA chip-rate 


3.84 Mcs 


f 


Coefficient of the equilibrium 


a r\ 1 1 C (Tf ' ' 111 

40 ± 15 % in microcellular 




other-to-own cell 






interference 




n 


Coefficient of the thermal 


-10 dB 




noise density and /oreq 






Mean of stationary system ca- 


0.6429 




pacity (SSC) 




Cj 


Lower limit of SSC 


0.5806 




Upper limit of SSC 


0.7200 


E 2 


Variance of SSC 


9e-03 


L 


Cell perimeter 


200 m (microcellular) 


A 


Cell area with hexagon shape 


2.5981 x 10 nr 


£[d] 


Mean of users' velocity 


1 m/s 




Bit-rate for class 1 


12.2 kbs (EFR voice) 


/target- 1 


SIR target to meet class 1 QoS 


6 ± 1 dB for BER = 10 x 10~ 3 




(BER) 




P\ 


Voice activity factor during the 


0.4 




call 




n 


Bit-rate for class 2 


64 kbs (video) 


Ktarget-2 


SIR target to meet class 2 QoS 


2.75 ± 0.75 dB for BER = 10 x 10~ 5 


^3 


Bit-rate for class 3 


144 kbs (multimedia) 


/target-3 


SIR target to meet class 3 QoS 


2 ± 0.5 dB for BER = 10 x 10 5 


W\ 


Mean of class- 1 connection- 


0.0050 




basis effective load factor 






(CELF) 




u>u 


Lower limit of class 1 CELF 


0.0040 


Wu,l 


Upper limit of class 1 CELF 


0.0063 




Variance of class 1 CELF 


2.69 x 10~ 6 




Mean of class 2 CELF 


0.0304 


U>1,2 


Lower limit of class 2 CELF 


0.0257 


Wu,2 


Upper limit of class 2 CELF 


0.0360 


9 


Variance of class 2 CELF 


3 x 10~ 5 




Mean of class-3 CELF 


0.0561 




I nwpr limit of pin*;*; ^ f^FT F 


0 0503 


Wu,3 


Upper limit of class 3 CELF 


0.0625 




Variance of class 3 CELF 


7 x 10~ 5 




Mean call-holding time 


120 s 




Empirical factor in 0(c) 


0 
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normal-distributed random variable having mean Wk and falling in between a max- and 
min-guaranteed resource consumption u> u>J t andwi^. Similar to the well-known effective 
bandwidth concepts, this can be easily extended for modeling resource consumption of 
a VBR or ABR connection based on extensive reported results in analysis of, e.g., ATM 
networks. The cell-capacity is modeled accordingly with a bounded Gaussian random vari- 
able having mean C a and lying in between Q and C u . The mean and boundary values of 
resource consumption and cell-capacity can be estimated using Equations (12.1) and (12.3) 
together with mean and boundary values of local average SIR target and /. Further, C\ and 
C u constraints can be more relaxed to determine. For example C u can be set up to (1 — rf) 
in theory. 

The most straightforward CAC scheme is to share C a units of the cell resources com- 
pletely and statically among calls of different service classes, where a class-fe call consumes 
Wk units of resources for its service. This leads to product-formed loss network model with 
modeling-based static complete- sharing CAC (MdCAC), where a call request for class-A: 
service is accepted immediately if at most (C a — w^) units of resources are occupied, other- 
wise rejected. Such MdCAC in CDMA cellular systems may face the following problems: 

(1) the QoS guaranteed scenario results in the 'worst case' of system capacity and RRU; 

(2) there is a need for redesign of the system-capacity constraint C a for the changes of 
CDMA radio channel parameters and traffic statistics; 

(3) the advantages of CDMA techniques, such as the 'soft capacity' feature, cannot be 
exploited. 

The results of recent research have emphasized that robust CAC policies may require 
online measurements [57, 59], resulting in far more complex and high-cost SW/HW imple- 
mentations [47, 49]. 

The principle of measurement-based CAC (MsCAC) is that a new call is accepted im- 
mediately into the system if at least the required resource consumption of its service class 
is available, otherwise rejected. The decision is made based on online measurements of 
the related statistics. Analysis and simulation results show that, when the UL traffic is less 
bursty, the measurement-based CAC has no capacity-gain over the modeling-based CAC 
described above. It may even suffer degradation due to measurement errors, which agrees 
with References [49, 57, 58]. 

To harmonize the advantages and overcome the problems of MdCAC and MsCAC with 
a simple and effective SCAC policy, let us first reconsider Equations (12.2) and (12.3) in a 
slightly different way. Owing to the interference-limited nature of CDMA cellular networks, 
n-forming active-user combinations in the cell shall meet their QoS constraints if: 

Cother < (1 - n) ~ Cown (12.4) 

where c ot h e r represents the load factor produced by transforming the other-cell interference; 
c own = nw represents load factor generated by n-forming active-user combinations in the 
cell, n is the occupancy vector and w is the load vector defined above. Because it has been 
shown that c ot h er is well modeled with a Gaussian random variable [41-43, 46], we have: 

Pr{ Cother <1-1- Cow„} = 1 - q( 1 -*- *™ - E ^e}) (12.5) 
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where Q{x) is the standard normal integral function and 

E[c ot her] — fE[c ovin ] 

Var[c othtI ] = fVar[c own ] 



(12.6) 
(12.7) 



The above relations imply that, as UL traffic increases beyond the average capacity, whether 
users experience a degradation of the communication quality depends on the other-cell 
interference level. Therefore, an optimal way to benefit from this soft-capacity feature 
is to use a dynamic CAC mechanism having decision functions adapted to interference 
distributions to compensate for the fluctuation of local average SIR while expanding the 
admissible region. SCAC is such a solution that works as follows. Instead of using an 
average cell capacityC a , use an upper- and a lower-limit denoted by C u and C\, respectively. 
This improves the system capacity, at the same time enhances the robustness. In principle, 
the mechanism of SCAC is defined as follows: 

(1) a new call of class-/: shall be admitted immediately into the system if the current UL 
load factor is at most (C/ — u>k); 

(2) else if the current own-cell UL load factor is at most (C u — Wk), a new call of class-A; 
shall be admitted with a permission probability based on Equation (12.5), which is 
given in detail later; 

(3) otherwise, a new call of class-A: shall be rejected immediately; 

(4) nonpre-emptive priority is given to handoff calls, i.e. handoff failure of class-A: calls 
occurs only when more than (C u — w k ) resources have been occupied already. 

SCAC policy is supposed to give advantages over both MdCAC and MsCAC policies, 
i.e. simple, robust and improved capacity utilization. Soft decisions can also be combined 
with measurement-based techniques to reduce the complexity of estimators, to compensate 
for bias and to enhance performance. The implementation issues will be addressed later. 
Moreover, one can notice that SCAC provides better traffic shaping gain than MdCAC and 
MsCAC. It gives more chance for calls of high resource-consuming classes to access the 
system when traffic intensity of lower classes is heavy that cannot be improved with the 
other two. 

12.4.2 QoS differentiation paradigms 

To regulate the operation of multimedia systems in the way mentioned above, the access 
control has to consider not only characteristics of requested services but also subscription 
profiles of user classes. This will provide fair decisions in resource allocation for agreed 
QoS upon accommodating a new call request. For example, a gold-class customer request 
for any services shall be served immediately as long as there are enough network resources 
for that. On the other hand, operators might reject all bronze-class requests as well as 
resource-consuming requests from silver-class if the current load has already exceeded a 
certain level, and so forth. However, once a call request is accepted, QoS in terms of BER for 
that RT connection is assured independently of its associated user class. QoS differentiation 
paradigms for CBR services specify such serving traffic patterns depending on offered traffic 
intensity distributions among classes during different busy periods of the day for shaping 
traffic and maximizing revenues. These therefore need to be simple and easy to reconfigure 
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and extend for a variety of user and network profile characteristics. QoS differentiation can 
be viewed as traffic prioritization in CAC. Herein, we use multiple thresholds or load-based 
fracturing factors for hard or softer blocking of call requests. The methods are similar to 
the well-known threshold dropping or weighted fair queueing mechanisms, but no actual 
queues are allowed in this system. Thus, system resources are shared noncompletely among 
RT traffic classes in a blocked-call-cleared fashion. Define: c — nw is the stationary system 
load state, which is identical to c own above. Suppose there are J different user classes sorted 
in the decreasing order of priority, e.g. 1 is gold, 2 is silver, etc. Let J = {1, 2, ... , J}. 
Taking M different service classes into account, which are sorted in the increasing order 
of resource consumption, we introduce a J x M table of prioritized admission probability 
given in matrix-form as follows: 

A = [a jk (c)] j e J, k e K, 0 < a jk (c) < 1 (12.8) 

where cijk(c) is the admission probability of a call request for class-A: service from a class- j 
user, which depends on system load state and needs further modification when combining 
with SCAC described above. However, it can be considered that there are J x M prioritized 
traffic classes, which form group behaviors according to user class or service class. 

Note that hereafter we use term 'traffic class' or class-(y, k) when QoS differentiation 
is applied to distinguish from service class-A: alone in complete-sharing scenarios. In hard 
threshold blocking case, straightforward, J x M thresholds may be needed, each corre- 
sponding to a traffic class. Define a threshold table given in matrix-form as follows: 

L = € J, k e K, Ijk > /„„ if j > u and k > v 

Thus, not taking effects of SCAC into account, ajk(c) can be determined as: 

The number of needed thresholds is significantly reduced if only group behavior levels are 
of interest. That is, J or M thresholds may be needed instead of / x M. For example, the 
first priority user class has a single threshold up to upper bound C u of the system capacity 
regardless of required services, whereas the lowest priority user class has a threshold as lower 
bound C\ of the system capacity for any services. The access of the lowest priority user class 
would not be granted for k to M services if system load state exceeded Ijk < C\ . In fractional 
(soft) blocking case, a 7 i(c) may not take a hard value of 1 or 0, but in between depending on 
system load state and priority of traffic class, resulting in load-based fracturing factors for 
each traffic class or group behavior. As a simple example, let say a call request from a class- j 
user for class-A- service is admitted immediately into the system if the load state after that does 
not exceed the threshold; else it is accepted with a probability of 0.8 if the system load 
state after that does not exceed the lower bound C\ of the system capacity; else it is accepted 
with a probability of 0.2 as long as there are enough resources left for that. Generally, this 
QoS differentiation paradigm on the one hand gives operators better flexibility to customize 
and to tune their network performance; on the other hand it allows unified analysis of a class 
of guard resource schemes for QoS differentiation and CAC. The hard threshold blocking 
case described previously is in fact a special variation of this fractional rule. 

Up to this point, handoff calls have been assumed to have either the highest priority 
regardless of their associated traffic classes or equal priority as of new calls regarding their 
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associated traffic classes. If not so, prioritization of handoff calls can be handled similarly 
to new calls. 

QoS differentiation of the background NRT packet-switched services can be done not 
only in blocking or dropping fashion, but also in granting throughput-delay of a connection 
basis. For instance, the transmissions of certain customers should be guaranteed with higher 
data-rate and lower delay. Detailed paradigms and mechanisms are discussed later. 

12.4.3 Traffic model 

The traffic model is based on the following assumptions: 

(1) Sources are of the ON-OFF nature. Unless stated otherwise, = 1 for all 
k e K. The impacts of employing user activity detection are precisely studied 
in the example of a single-class system. 

(2) TPC is sufficient to ensure that the standard deviation of the local average SIR is 
always in control. The cell-capacity as well as the resource consumption of each 
class-A: connection for all k e K is a bounded Gaussian random variable lying in 
[Ci, C u ] and \w\ tk , w u ,k\ respectively with means and variances given in Table 12.1. 

(3a) Without QoS differentiation users and calls are served on complete-sharing FCFS 
(first come first served) basis. For all k e K, class-fc calls arrive at the corresponding 
cell according to Poisson process with rates ki ik and A-u,* for the new and handoff 
calls respectively. 

(3b) With QoS differentiation, for all j e J and k e K, let X\j k and Auj* be Poisson 
arrival rates for new and handoff calls respectively of traffic class-(j, A:). 

(4) The mean call-holding time given that there are no handoff failures is , and the 
outgoing handoff rate per cell per call is /u.2, which is commonly valid for all classes 
of calls. These are the exponentially distributed random variables. Therefore, the 
service time in the cell, i.e. cell-resident time, is also an exponentially distributed 
random variable having a mean equal to \l\x— 1/(/lii + 1x2). In the condition of 
user mobility equilibrium, i.e. the mean number of incoming mobile terminals equal 
to the mean number of outgoing mobile terminals per time-unit per cell, there are 
two simple models to determine the value for ^2 [60]. Denote: v as the speed of 
a mobile terminal, L as the length of cell-perimeter and A as the cell-area. In a 
macrocellular environment, terminals usually move with high speed along the cell 
in one direction. The linear model can be used to approximate ju,2 as follows: 



In a microcellular environment represented by a two-dimensional model with user 
random movement, ^2 is given by: 



With assumption (3a), the arrival rate of handoff calls can be approximated also as 
in [61] and [62]: 



H2 = E[v]/L 



(12.10) 



fi 2 — E[v]L/ttA 



(12.11) 



(12.12) 
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where B k is the new-call blocking probability and F k is the handoff failure proba- 
bility of class-fc. With assumption (3b), we have: 

Kijk = hjkil ~ B jk )(jii/H2 + F jk y l (12.13) 

where Bj k is the new-call blocking probability and Fj k is the handoff failure prob- 
ability of class-(y , k). 

(5) Three different CAC policies are adopted alternatively for the performance com- 
parison purposes. The first and the second one are the modeling-based MdCAC 
and the measurement-based MsCAC of static complete-sharing policies. The third 
one is the SCAC policy, which is discussed first in complete-sharing and then in 
QoS differentiation scenarios. 

For the complete-sharing SCAC, define: TCk(c) = the admission probability of a new 
call for class-A: service when the system is in state c. From Equations (12.45), (12.46) and 
(12.47), Ttk(c) can be given by: 

1 , if c + W k < C; 

l-r)-c-w k -fflc]\ ...... 

. , lfCj < c + w k < C u (12.14) 

0 , otherwise 

To avoid the overestimate of C u that may increase the system outage probability, some 
adjustments of the mean and variance, e.g. by increasing them slightly, or other functions, 
such as the incomplete Gamma, can be used. For the later option, it k (c) becomes: 

7i k {c) — 1 — T[v, a(l — r) — c — w k )] for C\ < c + w k < C u (12.15) 

where V (v,x) is the incomplete Gamma distribution function. The a and v parameters have 
the following relations: 

fE[c] — v/a and/Var[c] = v/a 2 

In general, E[c] and Var[c] can be calculated by using steady- state solutions and inversion 
techniques. To reduce the computation complexity, the corresponding mean and variance 
values of MdCAC system can be reused. This is reliable because, with assumptions (1) and 
(2) above, the loss network model can be expected to represent well the stationary behavior 
of those systems. For SCAC with QoS differentiation, by invoking Equation (12.8) we have: 

cij k {c) — ao_j k (c)jt k (c) for all / e J andfe e K (12.16) 

where the aoj k (c) factor is the admission probability of traffic class-(j, k) determined by 
QoS differentiation paradigms; the n k {c) factor comes from optimal interference manage- 
ment of SCAC given by Equation (12.14) or (12.15). The connection-oriented RT traffic 
has higher priority over the NRT traffic, which is served as background services. For clar- 
ity, additional modeling issues and assumptions for NRT packet access and services are 
discussed later. 
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12.4.4 Performance evaluation 

The performance measures of interests, which characterize the GoS, are the probabilities of 
new-call blocking and handoff failure. In optimization framework, a linear combination of 
these probabilities is usually used to define the GoS as a cost-function. The weighting factors 
are usually decided based on the 'importance' of calls considering new-call or handoff, user 
classes, cost of service classes, etc. For example, GoS for a single-class system in Zander 
Kim [50] has been defined as: GoS = (10F + B), where F is the handoff failure probability 
and B is the new-call blocking probability. The other important performance measure is 
the equilibrium QoS loss probability, i.e. the probability of losing communication quality 
in the system outage state. The Erlang capacity of the system can be determined for given 
percentages of these loss probabilities. In complete-sharing systems, for all k e K and in 
steady-state condition, denote: B k as the new-call blocking probability of class-A: calls; and 
F k as the handoff failure probability of class-/: calls. In systems with QoS differentiation, 
for all j G J and teK and in steady-state condition, denote: B jk as the new-call blocking 
probability of traffic class-(y, k); and Fj k as the handoff failure probability of traffic class-(y , 
k). 

In addition, operators may be interested in group behavior. Thus, for all j € J and in 
steady-state condition, denote: Bj as the new-call blocking probability of class- j users; and 
Fj as the handoff failure probability of class- j users. Finally, denote: P] oss as the equilibrium 
QoS loss probability. 



12.4.5 Related results 

In this subsection, the results of extensive studies on product-form loss networks, stochastic 
knapsack problems and effective-bandwidth allocation [63-68], are summarized, which 
supports analytical methods used in this section. The single-link with no buffering and 
complete-sharing loss system is considered. Let the system capacity be C and the offered 
traffic intensity of class-A: is a k — Xk/l^k, where X k is the Poisson arrival rate and 1/ju* is 
the mean service time of class-/: call. Define the set of all possible system states: 



I *-<=K 



n k w k <C\ (12.17) 

keK 

The steady-state probability of system being in state n has a product-form: 

1 a" k 

p(n) = - U^-netl ( 12 - 18 ) 

keK M * ! 

where 

Ilk 

neQkeK k ' 

The blocking probability of class-A: calls can be determined theoretically by: 

B k = < 12 - 2 °) 

neQ:nw+u> t >C 

The cost of computation with the above formula can be prohibitively high for larger-size state 
set, i.e. for large K and C/w k . This problem has been considered by many authors, resulting 
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in elegant and efficient recursion techniques for the calculation of blocking probabilities. 
For practical evaluation purposes of this section, i.e. keeping it accurate, but as simple 
as possible without loss of generality, the stochastic-knapsack approximation described in 
References [63, 66] is chosen and believed to suite best investigating various load-based 
and cost-effective CAC policies. 

Define the set of all feasible system load states: 

* = jc:c=nw,nel]) (12.21) 

The steady-state probability of the load state c is given by: 

q(c) 

s(c) = (12.22) 
ce* 

where q(c) is given in a recursive form as follows: 



q{c) 



1 \ - 

- 2_ w k a k q(c - w k ); cer, q(0) = 1 and q(-) = 0 (12.23) 



c k€K 



The blocking probability of class-A: calls now can be determined by: 



B k = * (c) (12 ' 24) 

ce*:c>C-w t 



12.4.6 Modeling-based static complete-sharing MdCAC system 

The above results can be applied directly for analyzing this system with the parameters as 
follows: the system capacity is C a instead of C; a k — (Ai,* + Ahi,fc)/(/Xi + /x 2 ). Then, B k is 
obtained by using Equation (12.24): 

F k = B k (12.25) 



E cs{c)Q 



(l-r,-c)-JE[c] 



ce* \ JfVar [c] / 

floss = V T\ (U2 ^ 

E c*(c) 

ce* 



where 



E[c] = ^cs(c) (12.27) 

ce* 

Var[c] = c 2s (c) ~ £2 W (12.28) 

ce* 

Given the offered traffic intensity of each service class, the performance of the system 
strictly depends on C a and w, which are needed a priori. Overestimates or underestimates 
of these parameters may result in wasting or insufficiently allocating resources, thereby re- 
ducing the GoS or lowering the perceived QoS, respectively. Furthermore, in the presence 
of additional uncertainty causing the changes in traffic descriptor parameters, the system 
cannot provide appropriate QoS to the users and therefore C a and w need to be redesigned. 
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However, with a reliable modeling, good overall statistical multiplexing gain and perfor- 
mance can be expected. From the TDMA/GSM experiences, static channel allocations in 
some circumstances can even provide better capacity gain than dynamic ones. Note that 
C a and w can be improved, i.e. maximizing C a while keeping w as low as possible for 
given QoS requirements, by using techniques like sectorization, multiuser detection, smart 
antennas, etc. 

12.4.7 Measurement-based complete-sharing MsCAC system 

MsCAC should overcome the nonrobustness problem of MdCAC described above. The 
network side attempts to learn statistics of the traffic by online measurements. Under the 
assumption (2) of the traffic model, stationary w k and C a are bounded Gaussian random 
variables having mean w k and C a , and variance a% and S 2 respectively. Here, C a and w k 
denote the random variables and also their mean values. The variances can be estimated 
by using corresponding boundary and mean values. Let be the acceptable equilibrium 
outage probability of the system. 

The goal is to quantify and to illustrate the system behavior with its sensitivity to estima- 
tion errors, and to lay the groundwork for studying the scaled aggregate UL load fluctuations 
of the traffic with more sophisticated models. It does not aim to study details of any specific 
measurement-based CAC algorithms, neither to provide the parameter estimation frame- 
work. Some valuable results for measurement-based CAC in single-class systems can be 
found in Dziong etal. [57], whereas Sampath and Holtzman [58] investigate measurement- 
based CAC algorithms by simulations. Herein, the multivariate Gaussian approximation is 
used to dimension the admissible region. In the MsCAC system, stationary constraints of 
the system capacity can be defined by: 



The components of Equation (12.29) are obtained by online estimations. The additive 
uncertainty due to measurement errors may have significant impacts on the system per- 
formance. In addition, the need for reliable estimations of mean and variance values for 
various service classes and cell-basis may result in far more complex hardware/software 
implementations compared with the modeling-based policies. The performance character- 
istics of the MsCAC system will be obtained by simulations of a simple memoryless and 
an auto-regressive measurement-based system. The over-bounding hyperplane of the ad- 
missible region can be determined by intersection of the axes in M-dimensional Euclidean 
space. The intersection point (0, 0, . . . , q, 0, . . . , 0) corresponds to the single-class capacity 
region of class-A: that is given in estimation by: 




(12.29) 




(12.30) 



For the Gaussian estimation errors, q can be quantified by: 




(12.31) 
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where e, S are N(0, 1) normal random variables [57] representing impacts of measurement 
errors. It has been shown in Sampath and Holtzman [58] by simulations that measurement- 
based CAC policies are best suited for serving quite bursty traffic. Their performance is 
almost the same, and none of them are capable of meeting the loss targets accurately. One 
can reach the same conclusion from the above analysis. 



12.4.8 Complete-sharing dynamic SCAC system 

It has been shown [64, 66, 67] that for larger C/Wk systems, which is true in WCDMA cellu- 
lar systems, the stationary behavior of system load states c approaches the one-dimensional 
Markov process. Thus, for analysis of the SCAC system we invoke Equations (12.21)- 
(12.23) with the following modifications. The system capacity is now C u . The behavior 
of the system when the system load state is less than C\ is exactly the same as that of the 
complete-sharing system. When the system load state is between C\ and C u , a soft decision 
is used for the acceptance of a new call. The highest priority is given to a handoff call. 
Equation (12.23) now becomes: 

q(c) — - ^ WkUkic — u>k)q(c — Wk); c G q(0) — 1 andg(— ) = 0 (12.32) 
c keK 

where 

, . )-hi,k + k,kJtk(c) 

ak( - c) = 7 — T — x M , n , ( 12 - 33 ) 
(Mi + M2)[l + @(c)] 

7i k(c) is the admission probability of a new call given by Equation (12.11) or (12.12); 
0(c) is the function of system load state representing the call-drop rate due to loss of the 
communication quality. For instance, a linear model for 0(c) with a constant empirical 
weight-factor £ can be defined as follows: 

0(c) _ (Oifc< Q 

uw 1 H(c - Ci)/(C U - d) G [0, 1] if Q< c < c u uz "^ ; 
The performance measures are obtained as follows: 

B k = *(c)[l-^(c)] (12.35) 



E 



s(c) (12.36) 

ce^'.oCu—Wk 



E cs{c)Q 



(i-n-c) -je[c] 



y/Var[ 



With this SCAC, GoS including new-call blocking probability and handoff failure probabil- 
ity can be significantly improved over MdCAC and MsCAC. However, in complete- sharing 
systems, beyond a certain load state, higher resource-consuming users hardly can gain ac- 
cess into the system if the traffic intensity of lower resource-consuming users is relatively 
heavy. This can be overcome by using QoS differentiation combined with SCAC for meeting 
potential customer demands. 
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12.4.9 Dynamic SCAC system with QoS differentiation 

Assume that handoff calls have the highest priority regardless of their user class association. 
For characterization of the steady system load states, we invoke Equations (12. 17), (12.21), 
(12.22), (12.32) and (12.34) and the assumption (c2). Equation (12.32) is now modified as 
follows: 

hii.jk + hjkajk(c) 

a k (c) = — (12.38) 

(Mi + M2)[l + ©(c)] 

The performance measures of interests for this system are given as: 

B jk = Y J s{c)[\-a jk {c)\ (12.39) 

F jk = s( -^ ( 12 - 4 °) 

ce*:c>Ca-u)t 

By summing up B j k or F jk over J or K set, we can obtain loss probabilities for group 
behaviors. Thus: 



Bj=J2Bjk (12.41) 

keK 

Fj=J2^ (12-42) 



keK 



The equilibrium QoS loss probability can be obtained by using Equation (12.37). By up- 
dating admission probability table, A = [aj k (c)]j e J, k e K, 0 < aj k (c) < 1, as defined 
in Equation (12.8), operators can easily control and optimize the GoS of their serving 
networks. 



12.4.10 Example of a single-class system 

This section provides a simple probabilistic interpretation of the SCAC analysis presented 
above based on a single-class system. In this case, the analysis as well as the implementation 
can be very much simplified and presented by numbers. However, the insight into system 
behavior is still preserved and well represented. Let the call arrival rates at the corresponding 
cell be Xi and X^ for new and handoff calls respectively, r the bit-rate and y t arget the SIR 
target of the service. Let p be the probability that source is active during its conversation, 
i.e. actually transmits data independently from other sources. Denote n as the number of 
users occupying the cell, i.e. the number of calls being served in the system and j as the 
number of active users among n users. The average capacity, i.e. average upper limit of the 
number of admissible calls in the cell, can be expressed as follows: 



(W/r)(l - r,) 
ttargetO + f)P 



(12.43) 



where |_jc J is the maximum integer not exceeding the argument. For the fixed channel 
assignment-based CAC system, Erlang's B-formula with the number of servers equal to 
c a vrg can be used to derive the performance measures. For the SCAC system, the other-cell 
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interference impacts can be expressed by: 

{W/r){\ - rj) 

j +m < = c 0 (12.44) 

Ktarget 

where m is a nonnegative number obtained by transforming other-cell interference into an 
equivalent non-integral number of active users. Since the other-cell interference is modeled 
as a Gaussian random variable, we have: 

or ^ -ii „ f (co- j)~ E[m]\ ...... 

Pr{m < c 0 - j} = 1 - Q( (12.45) 

V y/Var[m] / 

where £[m] and Var[m] can be obtained as: 

E[m] =fE[j] (12.46) 
Var[m] = /Var[y] (12.47) 

The lower and upper limits of the cell-capacity are given by C\ and c u , respectively. The 
permission probability of a new call at state n is defined as follows: 



Tt(n) : 



1 if n < ci 

(c 0 - j)-,fE [j] 
V/Var[j] 

0 otherwise 



i - Y,b(j\n, P )q (— ) irn<»<c u < 12.48) 



where b(j; n, p) is the binomial distribution: 

fo(7;«,p)=^P 7 (l-p)"^ (12.49) 

The handoff request is accepted right away if fewer than c u calls are being served in the 
cell, otherwise it is rejected. Denote p n as the steady-state probability of the system being 
in state n. The general steady-state solution of the birth-death process has a form: 

" X 

Pn = PoY\— fOTW > 1 ( 12 - 5 °) 

1=1 

where X n and /x„ are the birth and death rates at state n, respectively. In our system model: 

U h + ifO<«<c u 

1 0 otherwise v ' 

fi n — n(pi + /j, 2 ) for 1 < n < c u (12.52) 

Therefore, the steady-state probabilities can be determined using Equations ( 12. 50)-( 12.52) 
and the following condition: 



£>„ = 1 (12.53) 

K=0 

Define sj as the steady-state probability that there are exact j active users in the cell: 



n=j 
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where b(j; n, p) is given by Equation (12.49). E[j] and Var[j] in Equation (12.48) are given 
by: 

E[n = Y j js j (12.55) 

j=o 

VuV] = f^j 2 Sj-E 2 [j] (12.56) 
Closed-form solution of the performance measures are obtained as follows: 

B = Y jPn [\-n{n)] (12.57) 

n=c\ 

F = p Ca (12.58) 



(<•:, ./) .m\j\ 

Pi oss = —r^ 1 ^— (12.59) 



Figure 12.10 depicts the above characteristics for a voice-only system with voice-source 
activity factor p = 0.4. In order to ease the computations, the system parameters given in 
Table 12.5 correspond to the IS-95 system [48, 49]. It shows in comparison with Erlang's 
loss system that the SCAC system gains significant improvement in GoS. Overall, the SCAC 
system offers much better Erlang capacity and performance. 



12.4.1 1 NRT packet access control 

As mentioned before, NRT packet access in UL needs to be controlled so that QoS as 
well as GoS of the RT traffic is not affected by optimal throughput-delay tradeoff. For 




Figure 12.10 Benefits of SCAC in a voice-only system. 
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Table 12.5 Parameter for single-class voice-only CDMA system 





Definition 


Values 


w 


CDMA chip rate 


1.25 Mcs 


r 


Bit-rate of the users 


9.6 kbs 


Ktarget 


SIR to meet the required QoS, i.e. BER target 


i—l 1 -t IT* r TIT" 'T» 1 A 1 f\ — "\ 

7 ± l dB for BER = 10 x 10 


/ 


Coefficient of the equilibrium other-to-own 


40 ± 15 % 




cell interference 




n 


Constant coefficient of the thermal noise 


-lOdB 




density and 7 0 req 




p 


Voice-source activity iactor during the call 


0.4 


<-0 


lVTa Yimnm niimn.PT" of aptivp n^pr*; in an 

lVltlA.ll 11 Lll 11 llUlllUC^l Ul tlA^llVt; Llot^lS 111 dll 


23 




isolated cell 




Cavrg 


Average system capacity 


32 


C\ 


Lower limit of the system capacity 


25 


Cu 


Upper limit of the system capacity 


45 



design of such dynamic control mechanisms, there is a need for precise understanding 
of the dynamics of available UL radio resources for NRT packet transmissions while 
RT traffic is being served adequately. The main purpose of this section is to provide a 
reliable method based on asymptotic and quasi-stationary analysis of the RT traffic, to 
predict free capacity and average upper limits of UL data throughput. Based on the pre- 
diction results, a simple and effective DFIMA scheme is proposed for NRT packet access 
control. 

12.4.12 Assumptions 

Denote R p as the bit-rate for packet-data transmission in UL, L p as the packet-length in 
bits, y v as the SIR target to meet the QoS requirements of packet transmission and T p as 
the transmit time interval (TTI) needed for a packet at the air interface, T ? = L p /R p . The 
following assumptions are made in addition to those made earlier in this section. 

(1) RT services have higher priority over NRT messaging services. Resource consump- 
tion of the NRT packet-service domain should never exceed the free resources left 
by the RT traffic. 

(2) Users are sharing common channel for NRT packet access in UL using different 
DS/CDMA code sequences. The amount of UL resources consumed by a packet 
transmission can be determined similarly to the resource consumption of an RT 
connection that is given in Equation (12.1). The parameters needed for packet 
transmissions can be predefined or decided by the access control. 

(3) The time axis is divided into r s -long time-slots for packet transmissions, which 
can be equal to one or multiple radio-frame duration of, e.g., 10 ms as defined in 
3GPP standards. Packet transmissions are synchronized starting at the beginning 
of a time-slot. 
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(4a) Without QoS differentiation, T p is set equal to 7" s . Thus, packet transmissions are 
synchronized starting at the beginning and finishing at the end of the same time-slot. 
For a given time-slot, all transmissions have the same bit-rate and SIR target that 
may be changed on slot-by-slot basis. 

(4b) For different user classes, QoS differentiation should guarantee different maximum 
delays and minimum data rates when sending messages, e.g. emails, pictures, etc. 
Therefore, packet transmissions may have different parameters (bit-rate, SIR target, 
TTI) and resource consumption, which are controlled by the access control taking 
into account QoS differentiation paradigms. 

12.4.13 Estimation of average upper-limit (UL) data throughput 

Let free capacity of the system left by the RT traffic at time t be z(t). The stationary 
distribution and quasi-stationary behavior of z(t) over a sustained period of time are of 
interest. Let c(t ) be the RT system load state at time t and C(?) is the system 'soft' capacity 
at time t . Hence: 

z{t) = C(0 - c(0 

EDim^oo z(f)] = £{lim^ co [C(f) - c(t)]} 

E[z] = C a - E[c] (12.60) 

where z and c are the free capacity and the RT system load state in equilibrium condition. 
The z and c process itself is not Markovian in general, but for larger C^/iVk it behaves 
as an approximate Markov process. The quasi-stationarity of c over time interval r is 
approximated by an exponential function [64, 69]. 

For instance, to obtain the equivalent one-dimensional birth-death process for z and c in 
complete-sharing systems, the Pascal approximation can be used as in Grossglauser [67]. 
First, we need to scale the system capacity and resource consumption of each RT service 
class into integers. To do so we first assume min{tw*, k € K}= w\. Then define the scaled 
load vector, the scaled system state and the scaled average system capacity as follows: 

w* = \w* k , k e K} = { Lw/t/wiJ , k e K}; c* = [_c/u>\\ ; and 
C* a = [Ca/wii respectively. 

The set of equivalent system states is then given by: 

^* = {c*: C *=0,l,2,...,C*J 

Normalize the mean service-time of a RT call: \± = 1. The equivalent birth-death process 
being in steady state c* has a death rate of c* and a birth rate of: 

A,(c*)= u 2 /co 2 + c*(l -v/w 2 ) (12.61) 

where v and a 2 are given by: 

w k a k and co — > (w k ) a* (12.62) 

k€K keK 
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with oik = ih,k + ^w,fc)/(Mi + M2) or its normalized value a* = (A./,* + The 
steady-state probability s(c*) for c* € ifr* satisfies: 

c*s(c*) = X(c* - l)s(c* - 1) for /x = 1, c* > 1 and ^ s(c*) = 1 (12.63) 

0 

The quasi-stationary probability of RT traffic being in load state c* over period of time x 
can be defined by: 

p(c*, r) = lim Pr{c*(t + x) = c*|c*(f) = c*} (12.64) 

Using the exponential property of quasi-stationary probability in loss networks mentioned 
above, Equation (12.64) can be approached by: 

/Hc*,T) = e- TWc * )+e * ] (12.65) 

Equation (12.65) for x — T p provides the equilibrium probability that there are maximum 
z* = C* a — c* units of resources available for packet transmissions. Let S(c*) be the 
upper-limit number of packets that can be transmitted successfully in a given time-slot, for 
a given load state of the RT service domain c* .S(c*) can be calculated by: 

S(c*) = p(c*,T p ) (Ld ~ C )Wl (12.66) 

The above equation can be used to study tradeoffs of the parameters for optimal packet 
access. Let us assume that there are N data users, each generating data-packets according 
to the Poisson process with rate A per time-slot, A < 1. Hence, there are D = IN A] 
average active data-sources per slot. If each of them attempts to transmit their packets at 
the beginning of a given time-slot with the probability of min[l, S(c*)/D], the probability 
of successful packet transmission, i.e. having less than S(c*) initiated transmissions, is at 
least 1 /2. This is in accordance with the binomial distribution. The average upper-limit of 
UL packet-data throughput denoted by S is given by: 

where Pi oss is the QoS loss probability. By using Little's formula, the average lower limit 
of packet delay is given by S/D time-slots. 



12.4.14 DFIMA, dynamic feedback information-based access control 

The NRT packet access in UL needs to adapt to a quasi-stationary stochastic process of 
free capacity left by the RT traffic. DFIMA scheme is a promising candidate to optimize 
UL packet transmission characteristics and RRU. The motivation behind this scheme is 
as follows. Data terminals attempt to transmit their packets with a transmit permission 
probability (TPP), bit-rate and TTI changing dynamically on slot-by-slot basis according to 
the feedback information from the network. Similar access control schemes have appeared 
in different contexts. For instance, Rappaport [71] investigated the feedback channel state 
information-based carrier sensing MAC protocols for a centralized asynchronous CDMA 
packet radio system. Thomas et al. [70] proposed a real-time access control scheme for 
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a data-only system and Viterbi [48] proposed a 1-bit feedback scheme in an integrated 
voice/data system. The implementations might vary from one to another, but they all agreed 
that the feedback schemes should be simple and effective. Based on the results from the 
previous subsection, TPP and optimized bit-rate for packet transmissions in the next time- 
slot can be predicted. In the case of QoS differentiation, TTI or packet-length can be changed 
as well. These parameters can be transmitted in the content of feedback information. For 
instance, let the service criteria be to serve as many users simultaneously as possible with 
optimal throughput-delay tradeoff and no QoS differentiation. The number of active data 
users in next time-slot, D — [NAj , and load state c* of the RT traffic at the end of current 
time-slot are supposed to be known. Based on Equations (12.65) and (12.66) together with 
assumption (4a), a bit-rate can be chosen so that the S(c*)/D factor is as close to 1 as 
possible. TPP for users attempting to transmit their packet in next time-slot is set equal to 
min[l, S(c*)/D]. Thus, the maximum number of users can be served with optimum UL 
throughput and reasonable average packet-delay of S/D, where S is given by Equation 
(12.67). However, one can notice that, if a terminal has a packet to send, it needs to wait 
until next time-slot to attempt its transmission. Therefore, access delay to the first attempt 
per packet can be up to 1 time-slot. A packet transmission can be synchronized to the 
beginning of a mini-slot (e.g. a 10 ms radio frame can be further divided into 15 mini-slots 
as in 3GPP standards) to reduce the delay. This results in a spread-slotted asynchronous 
multiple access system with feedback control, which is similar to the schemes reported in 
References [71, 75-79] for a data-only CDMA radio system. DFIMA can be extended to 
provide flexible means of QoS differentiation. This can be done through setting different 
TPP or/and TTI or/and bit-rate for different user classes in feedback information depending 
on load state of the RT traffic and offered load of the NRT traffic. DFIMA can be expected 
to outperform the well-known ALOHA and CSMA in throughput-delay tradeoff to the 
same extend as shown in Rappaport [71]. Also, this approach should overcome the hidden 
terminal problem using feedback, and be simpler and more flexible than PRMA. 

12.4.15 Performance examples 

A number of numerical and simulation results are presented in this section in order: 

• to quantify the system performance and the benefits of using the described SCAC and 
the QoS differentiation; 

• to study the effects and tradeoffs of the design parameters in the access control on the 
system behaviors and the performance characteristics through various system scenarios; 

• to demonstrate the flexibility and the accuracy of the analytical methods used to evaluate 
the teletraffic performance of WCDMA cellular systems in this section. 

The results are presented for a cellular system supporting three RT service classes: class 
1 is 12.2 kbs voice, class 2 is 64 kbs video and class 3 is 144 kbs multimedia calls. The 
parameters are given in Table 12.4, which are in agreement with Ariyavisitakul [53]. Owing 
to limited space, we present results for offered traffic intensities of the three service classes 
in following proportions: 7:2:1, 5:4:1 and 4:3:3 of class 1, class 2 and class 3, respectively. 
These proportions are called multimedia traffic intensity profiles, MTIP In words, let A be a 
so-called common divisor of the three-class offered traffic; 7:2:1 MTIP, for instance, means 
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Figure 12.11 MdCAC vs MsCAC. 




Figure 12.12 Memory vs memoryless MsCAC systems. 

that the offered traffic of class 1 is IX Erlangs, class 2 is 2X Erlangs and class 3 is X Erlangs. 
Thus, the total offered traffic is 10 Erlangs if X = 1. 

Figure 12.1 1 presents the new-call blocking as well as the handoff failure probability of 
each service class in MdCAC and MsCAC systems vs the common divisor of the offered 
traffic intensities of the service classes in 4:3:3 MTIP. This figure confirms that there is 
no capacity gain using MsCAC for serving CBR services. Figure 12.12 shows the need 
for stable and reliable measurements in MsCAC systems. The performance characteristics 
of a memoryless measurement-based and a memory (auto-regressive) measurement-based 
system are presented also for 4:3:3 MTIP. Estimations with the help of auto-regressive 
filters may results in better performance, but more complex hardware/software is needed. 
Figures 12.13-12.15 present the handoff failure and new-call blocking probabilities of the 





Figure 12.14 SCAC vs MdCAC in 5:4:1 MTIP. 



SCAC system in comparison with the MdCAC system for 4:3:3, 5:4:1 and 7:2:1 MTIPs 
respectively. Figure 12.16 demonstrates that the SCAC system with the incomplete Gamma 
decision function offers better average communication quality, but slightly worse equi- 
librium blocking and dropping characteristics compared with the Gaussian function. The 
freedom of choosing such decision functions to fulfill the performance requirements in- 
creases the flexibility of the SCAC policy. The 4:3:3 MTIP is used in Figure 12.16. The 
QoS loss probability vs the offered traffic in 7:2: 1 MTIP are listed in Table 12.6 for different 
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Figure 12.15 SCAC vs MdCAC in 7:2:1 MTIP. 
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Figure 12.16 Flexibility of choosing decision function in the SCAC system. 

Table 12.6 QoS loss probability in comparison 



The three-class RT offered traffic, e.g. in 7:2:1 MTIP 



Common divisor 



Traffic 


1 


1.5 


2 


2.5 


3 


MdCAC 


0 


0 


0 


0.0001 


0.0007 


MsCAC 


0 


0.0002 


0.0024 


0.0114 


0.0315 


GaussianSCAC 


0 


0 


0.0001 


0.0020 


0.0119 


IncGammaSCAC 


0 


0 


0.0001 


0.0013 


0.0072 


SCAC with threshold QoSDiff 


0 


0 


0 


0.0001 


0.0005 


SCAC with fracturing QoSDiff 


0 


0 


0 


0.0003 


0.0021 
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Common divisor of the offered traffic 

Figure 12.17 SCAC system with threshold hard blocking QoS differentiation. 

CAC systems. Although the SCAC system suffers slight degradation of the communication 
quality, it yields significant improvements in the handoff failure probability and in the 
call blocking probability. The traffic shaping gain of the SCAC is clearly illustrated in 
Figure 12.15, where the traffic intensity of the voice calls is really high. For a predefined 
performance requirements, e.g. less than 0.1 and 0.5 % handoff failure probability for voice 
and other calls respectively, less than 1, 5 and 10% new-call blocking probability for 
class 1, class 2 and class 3, respectively, and 10 % allowable equilibrium outage probability, 
the SCAC system overall offers much better Erlang capacity. Moreover, there is no need for 
redesign of the capacity thresholds in the SCAC system as long as the range of allowable 
uncertainty is maintained with the help of other control mechanisms such as TPC, link- 
adaptation, etc. Thus, the robustness is also well improved over the MdCAC system. SCAC 
has demonstrated an efficient RRU and capacity enhancement. 

With QoS differentiation, operators can customize the operation of serving networks. 
Figure 12.17 presents the performance characteristics of the following simple scenario. 
Users are divided into two user classes: business (j = 1) and economy class (j = 2). Re- 
quests of the business users for any RT services are served immediately as long as there 
are enough resources for accommodating them. On the other hand, requests of the econ- 
omy users are served only if less than 70 % of effective resources are occupied by RT 
traffic, i.e. system load state c less than 0.5. Assume that demands for services of user 
classes are equal. Thus, arrival rates of new call requests from user classes for each service 
class are equal. Invoke assumption (3b) with A^i* = A/^/t for k — 1,2, 3. The offered traffic 
of e.g. 4:3:3 MTIP above can be split for each user class resulting in 2:2:1.5:1.5:1.5:1.5 
MTIP of six traffic classes. The factor aojk(c) of admission probability in (12.56) can 
be determined by (12.49), where the blocking threshold of business class is C u and 
of economic class hk is 0.5 for all k. This numerical example clearly demonstrates the 
effects of QoS differentiation on performance characteristics. The business class not only 
experiences much better GoS, but also better communication quality during the calls. 
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Figure 12.18 illustrates the QoS differentiation with load-based fracturing factors for soft- 
blocking of new calls of the economic class. This is based on a simple scenario as follows. 
Again we assume handoff calls have the highest priority regardless of associated user class. 
The business class is served as long as resources are available. The economy class can 
share the resources equally with the business class if less than 65 % of effective resources 
are occupied, i.e. c is less than 0.47. Otherwise, if c is less than Q, invoke (12.56) with: 
a o_2i(c) = 0.8, ao_22(c) = ao_23(c) = 0.6. If c is less than C u , flo_2i(c) = 0.4, flo_22( c ) = 
flo_23(c) = 0.3. Otherwise, a 0 _2i(c) = ao_22(c) = flo_23(c) = 0. 

The offered traffic is the same as in the previous scenario. Figures 12.17 and 12.18 
show that the performance characteristics of the system can easily be tuned by using either 
threshold-based hard blocking or fracturing factor-based soft blocking paradigms. For NRT 
packet radio access, additional parameters are given in Table 12.7. 

The average upper-limit UL data throughput for packet transmissions with 64 kbs and 
10 ms TTI is presented in Figure 12.19 vs different offered traffic intensities of the three 
RT service classes, which are in 7:2:1, 5:4:1 and 4:3:3 MTIPs. The impacts of bit-rates 
on average upper-limit throughput with constant T p duration of 10 ms are presented in 
Figure 12.20. Figure 12.21 illustrates the effects of T v with a constant bit-rate of 64 kbs. 
Table 12.8 summarizes the mean values of aggregate RT traffic and quasi-stationary free 




Figure 12.18 SCAC system with fracturing soft blocking QoS differentiation. 



Table 12.7 Parameter summary for packet radio access 



Definition Values 

T p Packet transmit duration 10, 20, 30 or 40ms 

R Bit-rate for packet transmissions 32, 64, 144 or 384 kbs 

Y P SIR target 3, 2, 1 .5 or 1 dB for the above 

bit-rates, respectively 
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Figure 12.19 Average upper-limit UL data throughput in different RT MTIPs. 
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Figure 12.20 Effects of the bit-rates to the throughput. 

capacity over T ? time-interval of 10 ms. Figure 12.20 and 12.21 and Table 12.8 are for 7:2: 1 
MTIP. One can see that throughput characteristics are affected significantly by the dynamic 
of RT traffic as well as the packet-transmission parameters. The results give valuable quan- 
titative merits for studying the design parameters and the performance tradeoffs of packet 
access control schemes. This explains the motivations of using DFIMA scheme presented 
above, where content of feedback information provides 1 : 1 mapping of optimal transport 
format combination (including TPR bit-rate, packet- length or TTI) for packet transmission 
in the next time-slot based on feasible free resource predictions. For example, assume 50 % 
cell capacity is occupied by the RT traffic of the 7:2:1 MTIP at the end of a given time slot of 
10 ms. The possible bit-rates for packet transmissions are 32, 64 and 144 kbs. Consider 
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Figure 12.21 Effects of the packet transmission durations. 



Table 12.8 Means of stationary RT aggregate traffic and quasi- stationary NRT resource 
availability 



The three-class RT offered traffic, e.g. in 7:2:1 MT1P 






Common divisor 




Traffic 


1 


1.5 2 2.5 


3 


E[c] 

E[(z;T p = 


0.1500 

10 ms)] 0.4609 


0.2249 0.2988 0.3690 
0.3793 0.3032 0.2347 


0.4305 
0.1772 



two cases of the NRT offered traffic: 10 or 50 active data users in the next time-slot. Using 
Equation (12.65) and (12.66), one can predict the maximum numbers of successful packet 
transmissions for each possible bit-rate, e.g. in our examples 16.91 packets of 32 kbs, 10.76 
packets of 64 kbs and 5.53 packets of 144 kbs. Therefore, in the case of having 10 active 
users in the next time slot, the feedback information should tell them to transmit their packet 
immediately with bit-rate of 64 kbs. If 50 users want to transmit their packet, they have to 
attempt with TPP of 16/50 and bit-rate of 32 kbs at the beginning of the next time slot. The 
expected throughput in this case is about eight packets since the free capacity is successfully 
utilized for packet transmissions with the probability of 1/2. The performance of DFIMA 
can be optimized with respects to TPP, bit-rate, TTI and QoS differentiation paradigms, 
which is flexible and effective. 

12.4.16 Implementation issues 

Although the MdCAC and the SCAC policies are simple to implement without need for any 
special software and hardware, they may face a problem because the modeling parameters 
are required a priori. Owing to the diverse nature of different traffic sources and their 
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often-complex statistics, some of the parameters may be hard to determine without which 
the modeling-based CAC policies cannot operate. The soft-decision solutions are believed 
to give more flexibility in determining the modeling parameters, and thus are quite suited 
to achieving good multiplexing gain and robustness. On the other hand, implementations 
of the MsCAC policy require advanced hardware and software to ensure the reliability of 
measurements. For this reason, it is not cost-effective. Moreover, estimation errors in some 
circumstances may cause significant degradations of the system performance. However, the 
advantage of MsCAC is that it seems 'insensitive' to the traffic nature and the operation is 
robust. The network can learn and adapt to the statistics of traffic even when the burstiness of 
traffic is considered as out of control for the modeling-based systems. To gain tradeoff of all 
design criteria, a hybrid soft-decision/measurement-based implementation is a reasonable 
choice. Parameters needed for soft-decision functions, i.e. means and variances, can rely 
on auto-regressive measurements. For such solution, parameters and constraints can simply 
be thresholds of the UL interference level of cell and connection basis, an allowable outage 
probability, estimates of the current total received interference level with its mean and 
variance, etc. These are anyhow needed for the TPC mechanism of CDMA systems. For 
implementations of the DF1MA scheme, measurements or estimations of RT system load 
state for CAC can be reused. The NRT offered traffic needs to be measured or estimated 
for prediction of optimal parameters (e.g. TPP, bit-rate, TTI) that are used as the content 
of feedback information. Look-up tables for transport format combinations of UL packet 
transmission can be implemented or configured in both mobile and access network sides in 
order to minimize the size of feedback information. Eight bit feedback is enough to ensure 
sufficient exchange of control information in DFIMA, even with QoS differentiation. 



12.5 JOINT DATA RATE AND POWER MANAGEMENT 

As already seen from the previous discussion the radio resource manager (RRM) contains 
a number of sub-blocks like the connection admission controller, the traffic classifier, the 
radio resource scheduler and the interference and noise measurements. The main role of 
the RRM is to manage the different available resources to achieve a list of target QoS. 
The radio resource scheduler (RRS) is an essential part of the RRM. The RRS has two 
important radio resources to control: MS transmitting power and transmitted data rate. 
The RRS uses those two resources to achieve different objectives like maximizing the 
number of simultaneous users, reducing the total transmitting power, or increasing the total 
throughput. The conventional way to achieve these objectives is to select one of them as 
a target to optimize and use other objectives as constraints. More sophisticated algorithms 
based on multiobjective (MO) optimization and Kalman filter techniques have been also 
proposed. Here we address the problem of how to combine the power and rate in an optimum 
way. 

Even Shannon's equation shows that the achievable information rate in a radio chan- 
nel is an increasing function of the signal-to-interference and noise ratio. Increasing the 
information rate in data communication systems is restricted by the SINR. Increasing 
the SINR can be done in two ways. The first way is by reducing the total interference 
and noise affecting that user. This depends on some characteristics of the noise and the 
interference. For example, if the structure of the interference from other users is known at 
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the receiver then, by applying one of the multi-user detection methods, that interference can 
be reduced. Also if the users are spatially distributed then the interference can be reduced by 
using a multi-antenna system (see Section 12.1). If the users concurrently use the channel 
(as in DS-CDMA) then the interference can be reduced using power control techniques. 
From previous studies we can see that some characteristics of the interference are assumed 
to be known or can be controlled. There are many sources of interference and noises that 
cannot be reduced by the first way such as thermal noise, interference from other cells etc. 
The second way of increasing the SINR is simply by increasing the transmitted power. In 
a single user communication (point-to-point) or in broadcasting, this can be an acceptable 
solution and the main disadvantages are the cost and the nonlinearities in the power am- 
plifiers. However, in a multiuser communication environment, increasing the transmitted 
power means more co-channel and cross-channel interference problems. 

Therefore, a joint control of data rate as well as the transmitted power is an important 
topic in modem communication systems. The modern communication systems (3G or 4G) 
are supporting the multirate data communication because they are designed not only for 
voice communication but also for data and multimedia communication. 

An efficient combining algorithm for the power control and the rate control is required 
for these systems. The term 'efficient' here refers to optimization of the transmitted power 
and data rate to meet the required specifications. There are many proposed combining 
algorithms for power and rate control in the literature. The objectives of those algorithms 
are quite varied. Some algorithms suggest maximizing the throughput; others minimizing 
the packet delay or minimizing the total power consumption. 

The 3G/4G mobile communication systems based on WCDMA support the multi- 
rate transmission. There are mainly two methods to achieve the multi-rate transmission, 
the multicode (MC) scheme and the variable-spreading length (VSL) scheme. In the 
MC-CDMA system, all the data signals over the radio channel are transmitted at a ba- 
sic rate, R^,. Any connection can only transmit at rates mR^, referred to as m-rate, where m 
is a positive integer. When a terminal needs to transmit at m-rate, it converts its data stream, 
serial-to-parallel, into m basic-rate streams. Then each stream is spread using different and 
orthogonal codes. In a VSL-CDMA system, the chip rate is fixed at a specified value (3.84 
Mb/s for UMTS) and the data rate can take different values. This means that the processing 
gain (PG) is variable. The processing gain can be defined as the number of chips per symbol. 

12.5.1 Centralized minimum total transmitted power (CMTTP) algorithm 

The mathematical formulation of the CMTTP problem is find the power vector P = 
[Pi , . . ., Pq] t and the rate vector R = [R\ , . . ., Rq] t minimizing the cost function: 
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(12.68) 
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and 



< Pi < P m ^,Ri>n, V* = 1, Q 



(12.70) 



where <5 ; is the minimum required SINR for user i, /-,- is the minimum rate limit for i. 
The problem presented in Equations (12.68)-(12.70) can be reduced to a system of linear 
equations. If the constraints Equations (12.69)-(12.70) cannot be achieved, then the problem 
is called infeasible. In this case either some user should be dropped from this link or some 
of the constraints should be relaxed. At the optimal solution, all QoS constraints are met 
with equality. Also, the optimal power vector is the one that achieves all rate constraints 
with equality. So, the optimum rate vector is R* = [r\, . . . , r Q ] T . The corresponding power 
vector can be obtained by solving the QoS equation. This is a system of linear equations in 
power. From Equation (12.69) we have 
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where Sj is the target SINR for user Let r, = &Jri/R s and substitute it into Equation 
(12.71), to obtain 
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r = diagfn. . .r Q ] 
Then the optimum power vector is 

P* = [I-i-H]" 1 ™ 



(12.72) 

(12.73) 

(12.74) 
(12.75) 

(12.76) 



In order to obtain a nonnegative solution of Equation ( 1 2.77), the following condition should 
hold: 

p(rH) < 1 

where p(A) is the spectral radius of matrix A. 



12.5.2 Maximum throughput power control (MTPC) 

This algorithm has been suggested in Chawla and Qiu [85]. The algorithm is based on the 
maximization of the total throughput in a cellular system. There is no need to generate 
all solutions in this method. Since the gain links and the interference of other users are 
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needed to calculate the transmitted power of each user, the MTPC algorithm is a centralized 
algorithm. The throughput of user i can be approximated when M-QAM modulation is used 
by 

t, : = e + io g2 (r,) 

where 7} is the throughput of user i, © is a constant, and T, is the CIR of user i, and it is 
given in general by 

Tki = ff"' , » ^ r ™> i — I, ■ ■ ■ , Q,k — \, . . . , M 
l_ PjG kj + jV/ 



where Q — number of mobile stations, M — number of base stations and Gkj — channel 
gain between mobile station j and base-station k. The total throughput T is given by 

Q / Q \ 

r = J> = ee + io g2 ]~[ r < (1Z77) 

i=l V i=l / 

where g is the number of users. 

Now the problem can be defined as follows: given the link gains Gij of the users, what 
is the power vector P = [Pi, Pi, . . . , Pq]' which maximizes the total throughput! Since 
the first term in Equation (12.77) is constant and the logarithmic function is an increasing 
function, then maximizing the multiplicative term 

Q 

will lead to maximizing the total throughput T. The problem considered in Chawla and Qiu 
[85] is 



max 
p 
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s.t. P e Q (12.77a) 



where Q = {P\P min < P, < P max , i = 1, • ■ • , Q} C fi G . The MTPC algorithm to solve 
Equation (12.77a) is given by 

P k (t + l)=- L f = 0, 1 t=l,...,e (12-78) 
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Pmin < Pk(t + 1) < Pmax 

where Gjj is the channel gain between user j and base station i . Without loss of generality 
user i is assumed to be assigned to base station i . Starting from any initial vector P(0)e £1, the 
iteration specified by Equation( 12.78) converges to a unique point P* e £2, which achieves 
the global maximum [85]. 
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12.5.3 Statistically distributed multirate power control (SDMPC) 

A distributed solution of the optimization problem given by Equations (12.68)-(12.70) is 
proposed for one cell case in Morikawa et al. [80]. It is assumed that every user has two 
states, active ON or passive OFF. The transition probabilities of the z'th user from idle to 
active state at any packet slot is u, , and from active to idle state is £ . The durations of 
the active and idle periods are geometrically distributed with a mean of l/£; and l/u ( - (in 
packet slots), respectively. The optimization problem Equations (12.68)— ( 12.70) is slightly 
modified as 
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(12.81) 



One parameter has been added to original optimization problem which is the indicator 
function Pj(t). The indicator function is equal to one if the jth user is currently active, 
and zero otherwise. It is assumed in Morikawa et al. [80] that the random process p(t) has 
Markovian property since geometric distribution is memoryless over the duration of traffic. 
The centralized solution (if the system is feasible) is given by 
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where 



Yi = 



(12.83) 



The main idea behind the SDMPC algorithm is to estimate the other users' information part. 
Therefore the term (Ef=i PjiOYj) i s estimated. The Markovian property of the random 
process Pj(t) has been exploited to obtain a good estimate of the other users' information 
part. The SDMPC algorithm is given by 
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where p{t) is the estimation of E PjiOYj- 

7=1 

The estimated parameter p(t) has been derived in Morikawa et al. [80] for two cases: (i) 
there is no 'collision' at t ; and (ii) a 'collision' occurs at t . There are at least three drawbacks 
of this algorithm. First of all in the cellular CDMA system there is a control channel always 
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active (when the mobile phone is ON). Then, in the SDMPC algorithm, the channel gain 
and the average power of the additive noise are assumed to be known. In reality they should 
be estimated as well. Good estimation of the channel gain and the noise variance is usually 
difficult. In practice it is easer to estimate C1R or SINR because they have direct impact on 
BER. Finally, it was assumed that the durations of active and idle periods are geometrically 
distributed. This assumption is oversimplified and far from reality. 



12.5.4 Lag rang ian multiplier power control (LRPC) 

As mentioned previously, the data rates which can be achieved belong to a set of integers. 
In the formulation of the optimization problem, to maximize the data rate we assume that 
the date rate is continuous. This assumption can be relaxed in the simulation by rounding 
the optimum data rate to the nearest floor of the data rate set. It can be proven that the 
solution of the optimization problem with continuity assumption is not necessarily the 
same as the solution of the actual discrete problem. The advantage of the LRPC algorithm 
is that the optimization problem has been formulated without the continuity assumption 
of the data rates [81]. It has been assumed that each user has a set of m transmission 

rates M = {ri, ri, , r m ) to choose from. Let the rates be ordered in as ascending way, 

i.e. r\ < rz < . . . < r m . To properly receive messages at transmission rate r^, mobile i is 
expected to attain r,(P) > Tj k , where Tj k is the target CIR. 

Define Y = [vf ] to be a 0-1 matrix such that, for every mobile i and rate 

t _ J 1, if mobile i is transmitting with rate r k 
jo, otherwise 

The combined rate and power control is formulated as the following optimization problem 
[81] 
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where B\ is an arbitrary large number satisfying 
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' ~ p r,(P) 

The above optimization problem is solved using Lagrangian multiplier method. The main 
goal of LRPC algorithm is to maximize the total throughput of the system. Although the 
LRPC improves the system throughput, its power consumption for supported users as well 
as the outage probability is rather high. Therefore it is not recommended to be used in the 
systems where the fairness is an important issue. 
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12.5.5 Selective power control (SPC) 

The SPC algorithm has been suggested in Kim et al. [81]. The SPC algorithm is a logical 
extension of the DCPC algorithm [82]. The main idea of the SPC algorithm is to adapt the 
target CIR of each user to utilize any available resources. The suggested SPC algorithm is 
given by 

p ft ^u h^ r a / fl('> r a ^ p \\ 

P,(t + 1) = max < — x y I — < P max } 

k J r . (p) / ^ r . (jp) - maxy j 

t = 0, 1,..., 1 = 1, ...,Q 

where x(E) is the indicator function of the event E. Although the SPC algorithm improves 
the outage probability compared with LRPC algorithm, its outage is still high. 

Jantti suggested an improved version of the SPC algorithm. It is called selective power 
control with active link protection (SPC-ALP) Algorithm [83]. The SPC-ALP algorithm 
has less outage probability and better performance than the SPC algorithm. The main idea 
of the SPC-ALP algorithm is to admit new users into the network with at least the minimum 
data rate and also if possible allow old users to choose higher data rates. This is done by 
defining three different modes of operation for each user: 

• Standard mode, where the user updates its power using SPC algorithm. In this mode the 
rate cannot be increased but it can be decreased if needed. If there are more resources to 
be utilized by increasing the rate, the used mode is changed to the transition mode. 

• Transition mode, where the user updates its power using ALP algorithm. Also the rate is 
adapted to the maximum rate that can be supported. 

• Passive mode, where the user stops its transmission. More details about the SPC-ALP 
algorithm can be found in Jantti and Kim [83]. 

12.5.6 RRM in multiobjective (MO) framework 

The QoS can be defined for a set of factors. In this Section we will consider only the BER 
and the user data rate in the uplink. The objectives of the RRS could be defined as 

(1) Minimize the total transmitting power. 

(2) Achieve the target SINR in order to achieve a certain BER level (depends on the 
application). 

(3) Maximize the fairness between the users. In our definition, the system is fair as 
long as each user is supported by at least its minimum required QoS. In this sense, 
minimizing the outage probability leads to maximizing the fairness. 

(4) Maximize the total transmitted data rate or at least achieve the minimum required 
data rate. 

It is clear that objective (1) is totally conflicting with objective (4) and partially conflicting 
with objective (2). Objective (3) is totally incompatible with objective (4). Objective (2) is 
partially contradictory with objective (4). 
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So far, the RRM problem was formulated as a single objective (SO) optimization prob- 
lem considering the other parameters as constraints. Solving the objectives (l)-(4) at the 
same time as using MO optimization technique, leads to a more general solution than 
the conventional methods. In this section we discuss an MO optimization method to solve 
the RRM problem. In subsequent subsections we will discuss some radio resource sched- 
uler algorithms based on MO optimization. The field is very wide and many different 
algorithms and methods can be derived based on MO optimization. One formulation of the 
RRS optimization problem can be defined as: 
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where 0_P is the outage probability. The outage probability is defined as the probability 
that a user cannot achieve at least the minimum required QoS. We can see that the 0_P 
reflects the fairness situation in the communication system. The minus sign associated with 
the sum of the rate function in Equation (12.90) refers to the maximization process of the 
total utility functions. 

Defining the objectives and the constraints is the first step. Selecting the proper MO 
optimization method to solve the problem is the second step. Then the (weakly) Pareto 
optimal set of solutions is generated, where every solution is optimal in a different sense. 
Finally, the decision maker selects the optimum solution from the optimal set which best 
achieves the required specifications. In this section we discuss a framework to use the MO 
optimization techniques in RRM. 



12.5.7 Multiobjective distributed power and rate control (MODPRC) 

The algorithm is based on minimizing a multi-objective definition of an error function. In 
this algorithm we defined three objectives: (1) minimize the transmitted power; (2) achieve 
at least the minimum CIR, which is defined at the minimum data rate; and (3) achieve 
the maximum CIR, which is defined at maximum data rate. An optimized solution can 
be obtained using an MO optimization. 

The derivations of the algorithms are based on a VSL-CDMA communication system. 
After the dispreading process at the receiver, the SINR is 

*i(0=- i ^r i (f).f = 0,l,... (12.92) 

Ki(t) 

where 5,(0 is the SINR of user i at t, R s is the fixed chip rate (= 3.84 Mb/s for UMTS), 
Ri(t) is the data rate for user i at t, and r, (f ) is the CIR of user at t. In wireless and digital 
communication, it is well known that the BER is a decreasing function in the SINR. In 
case of coherent binary PSK, the BER can be approximated by (when the interference is 
assumed Gaussian) 

BERpsk = ^erfc(Vs) (12.93) 
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For example, if the BER should not be more than 10~ 4 then the target SINR is obtained from 
Equation (12.93) as <5 T > 8.3 dB. In the case of fixed data rate power control there is one 
target C1R that corresponds to the target SINR, because we have only one spreading factor 
value. In a case of multirate services there are different target CIR values corresponding 
to the target SINR. From Equation (12.92) it is clear that, in case of constant target SINR, 
maximizing CIR leads to maximizing data rate as follows: 

Ri(f)= $r,«,f = 0,1,... (12.94) 

\ 

Trying to achieve the maximum CIR for all users will end up in high outage probability. If 
there is a reasonable dropping algorithm then only one or few users will be supported [84]. 
To reduce the outage probability, we will define the target CIR at the minimum transmitted 
rate as 

T^in = %^, T (12.95) 

Also we will define the maximum CIR which is defined at the maximum transmitted rate 
as 

r,-, max = ^Sj (12.96) 

The target SINR, the minimum/maximum CIR, and the minimum/maximum data rate are 
time-dependent, but we dropped the time symbol (f ) for simplicity. In UMTS specifications 
the power is updated on slot-by-slot basis. The data rate is updated on a frame-by-frame 
basis. To generalize the analysis, we use the same time symbol for power and rate. 

To increase the fairness, the users should achieve at least the minimum target CIR, 
which corresponds to the minimum transmitted rate (e.g. 15 kb/s in UMTS). The multirate 
power control problem is defined as: given the target SINR vector 6 — [Sj, <5j , • ■ ■ , <5g]', 
the minimum requested data rate vector R m ; n = [Ri, m m, ^2.min, ■ • ■ , ^e.min]', and without 
loss of generality, assuming the maximum allowed data rate R m . dx to be the same for all 
users, find the optimum power vector P = [Pi, P2, . . . , Pq]' and the optimum rate vector 
R = [R\, R2, . . . , Rq]' that minimize the following cost function 
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,t=l,...,N, (12.97) 



P B m<P i <P Bm ,i = l,...,Q (12-98) 

N is the optimization time window, y is a real- valued constant adaptation factor. The notation 
( )' is used for transposed. The error e,- (? ) has been defined according to the weighted metrics 
method 

e,-(0 = ki,i\Pi(t) - P min \ + hil^iit) ~ F;, mm | + A;,3|r,(f) - n, max | (12.99) 

where 0 < A.,-^, A;, 3 < 1 are real-valued, constant tradeoff factors, 5Zit=i k,k — 1- The 
advantages of joining the weighting metrics method with the least square formula of 
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Equation (12.97) are: 

• The least squares method is well known and its derivation is straightforward. 

• A general solution is obtained using Equation (12.97), minimizing over all users and for 
time window N. 

The error function Equation (12.100) is the mathematical interpretation of the RRM 
objectives given in (a)-(d). The first term of Equation (12.100) is to keep the transmitted 
power Pi(t) as close as possible to P m i n , so we try to achieve objective (1). Objectives (2) 
and (3) will be achieved in the second part of the error function. In this part, the transmitted 
power is selected to obtain OR very close to the minimum required CIR. Achieving the 
minimum required QoS for every user maximizes the fairness in the cell. The third term 
in Equation (12.100) represents the objective (4), where the users will try to obtain the 
maximum allowed QoS if possible. 

By solving Equations (12.97) and (12. 100) (using same procedure of MODPC algorithm) 
for a one-dimensional (N = 1) case we obtain for i = 1, . . . , Q: 

Pj(t + 1) = hlPmi ° + ^ 2r '- mi " + A '.3 F -.max = (12.100) 

h,lPi(t) + (Xi,2 + Xi,3Wi(t) 



and as before 



*,■(* + 1) = Jr t (t) (12.101) 



< Pi(t) fli,min < Ri(t) < tfmax (12.102) 



If the minimum solution places such demands on some users that they cannot be achieved, 
then dropping or the handoff process should be applied [84]. The multirate power control 
algorithm given by Equations (12. 100)— ( 12. 103) has some interesting characteristics. By 
changing the values of the tradeoff factors A, , different solutions with different meanings 
are obtained. For example, when A., j = 1, X it 2 — 0, and A.,-,3 = 0, it is clear that Equation 
(12. 100) will be reduced to a fixed level (no) power control and user ;' will send at minimum 
power. For A, i = 0, A, 2 = landl/3 = 0, Equation (12. 100) becomes the distributed power 
control (DPC) algorithm of Grandhi and Zander [82]. In this case, the fairness is maximized. 
When A, i = 0, A, 2 = 0, and A, 3 = 1, algorithm Equation (12.100) will maximize the 
average transmitted rate (with using reasonable dropping algorithm for nonsupported users). 
In this case one or a few users will be supported, so the outage probability will be high. 
From previous extreme conditions, one can make a tradeoff between these objectives to get 
the best performance according to the required specifications. The selection of the tradeoff 
values should be based on the communication link condition as well as the network and 
the user requirements. A wide range of different solutions can be obtained by changing the 
values of tradeoff factors. The selection of one solution is a job for the decision maker. 



12.5.8 Multiobjective totally distributed power 
and rate control (MOTDPRC) 

In this section, we discuss a slight modification of the MODPRC algorithm, the totally 
distributed algorithm. The MODPRC algorithm, Equations (12. 100)— (12. 103), assumes 
the availability of the actual CIR value. In the existing and near-future cellular systems, 
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only an up-down command of the power is available at the MS. The estimated CIR is used 
with the MOTDPRC algorithm. The CIR (in dB) could be estimated as 

f i(t)jSB = r, T (r)-dB - g ( (f), t = 0, 1, . . . (12.103) 

where e, (r) is power control error, ly (f) is the target CIR, and f ,(f) is the estimated CIR. 
Using the estimated CIR in the MODPRC algorithm we obtain 

Pi(t) = ^'^"■ + A '.2 r mm + k 3 r max _ = 

x u p ( (f-i) + (^,2 + ^,3)r,W 

*,-(0 = Jf,W (12.105) 

Pmin < Pi(0 < ^max; ^i.min < Ri(t) < R m - dx (12.106) 



12.5.9 Throughput maximization/power minimization (MTMPC) 

Another application of the MO optimization in the RRM can be achieved by modifying the 
maximum throughput power control algorithm. The algorithm was based on maximizing 
the throughput and ignoring the transmitted power levels. In practice, reducing the trans- 
mitted power is very desirable. In this section we will formulate the cost function with two 
objectives. The first objective is the maximization of the total throughput as in Chawla and 
Qiu [85]. The second objective is to minimize the total transmitted power. The approach is 
treating the total throughput maximization and the total power minimization simultaneously 
using multiobjective optimization techniques. 

The problem is defined as follows: given the link gains of the users find the power 
vector which increases the total throughput as much as possible) and at the same time 
reduces the total transmitted power (as much as possible). The problem can be represented 
mathematically as 

max[6>i(P), -0 2 (P)} s.t.P G £2 (12.107) 
where P = [Pi, P2, . . . , Pq] j is the power vector, the objective functions 

oi (p) = nil r «-' and °2( p > = Ef=i P' 

and the admissible power set £2 = [P|P m i n < Pi < P m -dx, i = 1, . . . , Q}- The minus sign 
is used to minimize the second objective. We will use the weighting method to solve the 
multiobjective optimization problem. The idea of the weighting method is to associate 
each objective function with a tradeoff factor (weighting coefficient) and maximizes (or 
minimizes) the weighted sum of the objectives [86]. Applying the weighting method in our 
problem we obtain, 

max{0(P)} s.t. Pe Q, (12.108) 
p 

where 

Q 

0(F) = Ai Y[ T, (P) - X 2 1'P (12.109) 

i=i 
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is the multiobjective function, 1= [1, 1,..., 1]', and the tradeoff factors are real numbers, 
0 < A.i < 1, and X 2 — 1 - ki- 

Necessary conditions for solving the problem Equation (12.109) are 



VO(P) = 0 



(12.110) 



where VO(P) = [dO/dP u dO/dP 2 , dO/dP Q ] is the gradient of O. Substituting the 
CIR expression into Equation (12.109) we obtain 



0[P(t)]=k l f\ 



Pi(t)Gii 



£ Pj(t)Gij + Ni 



x 2 J2Pi(t) 



(12.111) 



i=l 



To maximize the reward functions, Equation (12.111), we find the power vector P which 
satisfies Equation (12.110). Since the obtained equations are nonlinear, it will be very 
complicated to get an analytical solution. An iterative solution for k = 1 , . . . , Q will be 
formulated (we will drop the iteration argument t for simplicity) 
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After simplification, 



(12.112) 
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A.2 = 0 



(12.113) 



which can be rewritten as 

Q I Q 
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(12.114) 



or 
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Solving for P* leads to 



Pt = 



X 7 



Q 

UGuPi 



«"=1 W< / 



1 



(12.116) 



and further to 



Pkit + 1) = 



1 



A 2 



Q / Q > 

n ew+» 

>=1 > 



(f:G r ,P J (t)+n 



(12.117) 

For A. 2 = 0, the problem is reduced to maximizing the throughput and Equation (12.118) 
reduces to Equation (12.78). Without power constraints Equation (12.78) is rewritten as 



Pkit + 1) : 



1 



G rf 



(12.118) 



and Equation (12.1 18) can be rewritten in a more compact form as 

Pk(t+1) 



Pkit + 1) = 



, , x 2 p k (i+i) 
1 ~ Xi a 

n r, (0 



, t = 0, 1, 



(12.119) 



where P^ e [P m i n , PmaxL k = 1, . . . , Q. From Equation (12.119), the new transmitted 
power is a scaled value of the transmitted power in the case of the maximum through- 
put algorithm. To compare the two-algorithms, the numerical example from References 
[85, 87] is used. Consider the system with 2 = 5 users and the path gain matrix, G, shown 
below. 



G(dB) = 



The tradeoff factors have been set to {X\, X2} = {0.9999, 0.0001}. In this case we penalize 
power usage. From Table 12.9, we can see that the summation of the SINR (dB) of the users 
[which is related to the total throughput as in Equation (12.78)] has not changed very much 
in both schemes (only 0.04 %), but the power has been reduced by more than 98 % in the 
case of MTMPC method. Additional information on the topic can be found in References 
[88-90]. 
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MTMPC algorithm MTPC algorithm 

k\ = 0.0001 and k 2 = 0.9999 ^ = 1 and X 2 = 0 



User 


P(dBw) 


SINR(dB) 


p(dBw) 


SINR(dB) 


1 


-13.9789 


16.8345 


-0.5580 


16.9295 


2 


-16.8918 


-0.8548 


-6.6072 


-0.9234 


3 


-4.6187 


36.0956 


13.4264 


36.8300 


4 


-12.8725 


8.6383 


1.2111 


8.5561 


5 


-14.3460 


-5.9278 


-1.5289 


-6.5922 




Average power 


Sum[SINR(dB)] 


Average power 


Sum[SINR(dB)] 




(W) = 0.1 


= 54.78 


(W) = 5 


= 54.80 



12.6 DYNAMIC SPECTRA SHARING IN WIRELESS NETWORKS 

In this section we present schemes for interference suppression in UWB-based WPAM sys- 
tems when sharing the same band as other communications networks. The scheme can be 
used to significantly improve the performance of UWB systems in the presence of interfer- 
ence from mobile communication systems such as GSM and WCDMA. It is also effective 
in the presence of WLAN systems which are nowadays based on OFDMA technology or in 
military communications where the interference is generated by intentional jamming. The 
section also discuss the effectiveness of the scheme to suppress MC CDMA, which is a 
candidate technology for 4G mobile communications. In order to demonstrate the relevancy 
of these results we first provide a systematic review of the existing work in this field and 
then present specific scheme. The results show that significant suppression gain up to 40 dB 
can be achieved in the presence of OFDM, WCDMA and MC CDMA, enabling coexistence 
of different networks in the same frequency bandwidth. The effectiveness decreases if the 
number of subcarriers is increased. 

The online source [91] gives historical perspective to UWB technologies. It lists down 
the early UWB references and patents from the 1960s and 1970s. In [92] a comprehensive 
overview of UWB wireless systems is given. It discusses the FCC allocation of 7.5 GHz 
(3. 1-10.6 GHz) unlicensed band for the UWB devices. Potential UWB modulation schemes, 
multiple access issues, single vs multiband implementation and link budgets are also dis- 
cussed. Paper [93] is a very frequently referenced one giving a brief introduction to the 
basics of impulse radio systems. It describes the characteristics of impulse radio and gives 
analytical estimates of the multiaccess capability under idealistic channel conditions. 

12.6.1 Channel capacity 

Some new channel capacity results for M-ary pulse position modulation (M-PPM) time 
hopping UWB systems are presented in [94]. It is demonstrated that the previous results 
based on the 'pure PPM model' have overestimated the real UWB capacity. The proposed 
model is extended with the correlator and soft decision decoding. The capacity is evaluated 
in the single-user case and with asynchronous multiple user interference (MUI) when the 
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inputs are equiprobable. It is found that larger M leads to increased capacity only at the high 
bit SNR region. Furthermore, optimal time offset values for each M are independent of the 
bit SNR. The MUI influence is detrimental for the capacity, especially at high bit SNRs. 

12.6.2 Channel models 

Paper [95] focuses on the UWB indoor channel modeling issues. The measurement data 
is collected from an extensive campaign in a typical modern office building with a 2 ns 
delay resolution. The model is formulated as a stochastic tapped delay line (STDL). The 
energy statistics due to small-scale effects seem to follow a Gamma distribution for all 
bins. Large-scale parameters can be modeled as stochastic parameters that can change, e.g., 
from room to room. UWB propagation channels are also discussed in [96]. Based on the 
modified CLEAN algorithm, estimates of time-of-arrival, angle-of-arrival, and waveform 
shape are derived. Key parameters of the model are intercluster decay rate, intracluster 
decay rate, cluster arrival rate, ray arrival rate and standard deviation of the relative azimuth 
arrival angles. Intercluster signal decay rate is generally determined by the architecture 
of the building. Intracluster decay rate, on the other hand, depends on the objects close 
to the receive antenna (e.g. furniture). Relative azimuth arrival angles were best fit to a 
Laplacian probability density function. Saleh and Valenzuela [97] present a model that has 
become a frequently referenced and adopted source in indoor multipath propagation channel 
modeling. They propose a statistical indoor radio channel model that (1) has flexibility to 
reasonable fit with the measured data, (2) is simple enough to be used in simulation and 
analysis, and (3) can be extended by adjusting parameters to represent various buildings. In 
the developed statistical model the rays of the received signal arrive in clusters. 

The ray amplitudes are independent Rayleigh random variables with exponentially de- 
caying variances with respect to the cluster delay and the ray delay. The clusters and the 
rays within the cluster form Poisson arrival processes with different, fixed rates. Paper [98] 
characterizes measurement-based UWB wireless indoor channels from the communications 
theoretic viewpoint. The bandwidth of the signal used in the measurement is over 1 GHz, 
resulting in the less than 1 ns time resolution. Robustness of the UWB signal to multipath 
fading is quantitatively evaluated through histograms and cumulative distributions. Two 
rake structures are introduced: the all rake serves as the best-case (benchmark) receiver 
and the maximum-energy-capture selective rake is a realistic sub-optimal approach. Mul- 
tipath components of the measured waveforms are detected using a maximum-likelihood 
estimator based on a separable specular multipath channel model. 

12.6.3 Diversity reception 

Performance of PPM and on-off keying (OOK) binary block-coded modulation formats 
using a maximal ratio combining rake receiver is studied analytically in [99]. The trade-off 
between receiver complexity and performance is examined. Several suboptimal receivers 
in indoor multipath AWGN channels have been employed. Results indicate the robust 
performance may require an increase in rake complexity. This implies allocation of more 
rake fingers and tracking of the strongest multipaths to help in the selection combining. Rake 
performance for a pulse-based high data rate UWB system in an Intel Labs indoor channel 
model is addressed in [100]. It is noted that, at low input SNR values (0-10 dB) and small 
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number of rake fingers, it is more beneficial to add rake taps for energy capture rather than 
for intersymbol interference (ISI) mitigation. In the presence of channel estimation errors, 
equal gain combining can be more robust than maximal ratio combining, and therefore 
yield better performance. In order to quantify the trade-off between rake receiver energy 
capture and diversity order [101] presents partly quasi-analytical and partly experimental 
analysis suited to dense multipath propagation environments. Numerical results show that 
a diversity level of less than 50 is adequate in typical indoor office conditions. 

12.6.4 Performance evaluation 

In [102] a method to evaluate the BER performance of time hopping (TH) PPM in the 
presence of multiuser interference and AWGN channel is proposed. Gaussian quadrature 
rules are used in this approach. Paper [103] concentrates on the signal design for binary 
UWB communications in dense multipath channels. The aim is to find signals with good 
distance properties leading to good BER performance that both depend on the time shift 
parameter r. Performance of UWB correlation receivers for equal mean power Gaussian 
monocyles is studied in [104]. Channel conditions vary among ideal single user AWGN, 
nonideal synchronous, multipath fading and multiple access interference. It is shown that 
the pulse shape has a notable impact on the correlation receiver performance. The effects 
can be seen in the autocorrelation function, especially in the mainlobe. The autocorrelation 
is highly related to the SNR gain of the output and to interference resistance properties. 
Special characteristics of the Gaussian monocycles include: (1) higher order derivatives 
have higher SNR gain in single user and asynchronous multiple access channel but are less 
robust to interference than lower order derivatives; (2) narrower pulses have higher SNR 
gain in asynchronous multiple access channel at the cost of inferior interference resistance 
ability. Exact bit error rate performance of TH-PPM UWB systems in the presence of 
multiple access interference (MAI) is analyzed and simulated in [105]. Furthermore, it is 
shown that, with a moderate number of MAI sources, the standard Gaussian approximation 
becomes inaccurate at high SNRs. 

12.6.5 Multiple access techniques and user capacity 

The main principles for multiaccess in UWB systems are discussed in [106]. A functional 
medium access, radio link and radio resource control architecture is proposed and open 
issues for future activities are addressed. Numerical throughput and delay performance 
results in radio resource sharing are shown. Uncoded and coded performance analysis for 
TH-UWB systems is covered in [107]. A practical low-rate error correcting coding scheme 
is presented that requires no bandwidth expansion. Gaussian assumption for multiuser in- 
terference is shown to be invalid for high uncoded data rates. The user capacity is shown 
to increase radically with the proposed coding scheme. M-ary signals ranging from M = 2 
up to M — 256 are used in [108] together with PPM signal formats in the UWB multiple 
access capacity analysis. Performance is analyzed in free-space propagation conditions. 
The number of supported users is dependent on the given bit error rate, SNR, transmis- 
sion rate and modulation alphabet size. Performance and receiver complexity trade-off is 
discussed. Upper bounds are derived for the combinations of user capacity and total trans- 
mission rate. According to the numerical examples it is possible to achieve a system with 
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high capacity and data rate, low bit error probability, and yet only moderate receiver com- 
plexity. Reference [109] is one of the first public and widely cited papers outlining the 
potential of time hopping impulse radio multiple access communications. It describes the 
basic building blocks of the impulse transmitter and receiver and their mathematical for- 
mulations. It also shows an example for the bit error vs user capacity estimate at variable 
data rates. Finally, some drawbacks of the high time-resolution impulse radio systems are 
mentioned: (1) the need for up to thousands of rake fingers in the multipath receiver; and 
(2) complex initial clock acquisition. In [110] a comprehensive overall description of the 
time-hopping UWB system physical layer issues is given. Achievable transmission rates 
and multiple access capacities are estimated for analog and digital modulation formats. 
Numerical results indicate that the digital implementation has the potential for nearly one 
order of magnitude higher user densities than the analog one. 

12.6.6 Multiuser detection 

Reference [111] is focused on the multiuser detection (MUD) possibilities for direct 
sequence UWB systems. It is demonstrated that the adaptive minimum mean squared er- 
ror (MMSE) MUD receiver outperforms the rake receiver both in energy capture and in 
interference rejection sense. Studied interference sources are narrowband IEEE 802. 1 la in- 
terference and wideband multiuser UWB interference. Ideally, MMSE receiver can achieve 
AWGN bit error rate within a 1-2 dB margin even in dense multipath channels. In heav- 
ily loaded conditions the penalty of 6 dB is experienced, but at the same time the rake 
receivers suffer from unbearable error floors. Iterative partial parallel multiuser interfer- 
ence cancellation (PIC) is applied to the UWB multiuser system in [112]. Matched filter, 
maximum-likelihood, and linear minimum mean squared error receivers are also used in 
the performance comparison. In this paper, multiuser detection is combined with error con- 
trol coding. The UWB system includes only one pulse per symbol and AWGN channel is 
assumed. Numerical results show that it is possible to attain the coded single user BER 
bound for eight to 15 users in a heavily loaded system without any processing gain. As 
the number of users increases and the bandwidth to pulse repetition frequency decreases, 
MAI is expected to adversely affect system capacity and performance. As a consequence, a 
framework for the design of multiuser detectors for UWB multiple access communications 
systems is presented in [113]. An optimum multiuser detector is also proposed. 

12.6.7 Interference and coexistence 

Coexistence of UWB system with some other radio systems is studied in [1 14]. This means 
the evaluation of interference caused by the UWB system to the other radio systems and vice 
versa. The coexisting radio concepts are GSM900, UMTS/WCDMA and GPS. Several short 
Gaussian-based UWB pulses are employed. According to the numerical results, convenient 
selection of pulse waveform and width leads to interference resistance up to a certain 
limit. The pulse shape is in interaction with the data rate. High-pass filtered waveforms are 
preferred in the case of short UWB pulses, whereas generic Gaussian ones are favorable if 
long pulses are utilized. Interference caused by narrowband systems is the most detrimental 
to UWB if it is located at the UWB system's nominal center frequency. In the GPS band 
the DS based UWB system interfered less than the time hopping system. 
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12.6.8 Channel estimation/imperfections 

Channel estimation for time hopping UWB communications is dealt with in [115]. 
Multipath propagation and MAI are taken into consideration. Maximum-likelihood esti- 
mation is applied in data-aided and nondata-aided scenarios. Numerical results show that 
the performance is reasonable if the number of simultaneous users is below 20. The impact 
of the timing jitter and tracking to the impulse radio system performance is investigated in 
[116]. Binary and 4-ary modulations are used. According to these studies both modulations 
suffer from the jitter, however 4-ary is better. Timing jitter is also discussed in [ 1 1 7] . Static 
and Rayleigh fading channels are assumed. Orthogonal PPM, optimum PPM, OOK and 
BPSK modulations are compared in performance evaluation. Similar performance degra- 
dation is noted for BPSK and PPM schemes, while OOK is more susceptible to large jitter. 
The probability density function of timing jitter due to rake finger estimation is simulated. 
The results depend on the pulse shape and SNR. Worst case distribution is shown to provide 
an upper bound for BER performance. The above survey of issues in UWB communications 
indicates a need for special attention to interference avoidance or interference suppression 
due to extremely wide signal bandwidth and the possibility of interference with other sys- 
tems operating in the same bandwidth. One way to deal with the problem is to design 
the pulse shape in such a way that the signal has no significant spectral component in the 
occupied frequency bands. Pulse shapes respecting the FCC spectral mask were proposed 
in References [118-120]. The drawback of such a solution is the need for over sampling 
and lack of flexibility in the case that the interfering signal is not stationary like in military 
applications. 

Another approach is to use adaptive interference suppression like the schemes sum- 
marized in the previous sub-section entitled interference and co-existence. The solution 
discussed in this section belongs to the latter category. We will demonstrate the advantages 
of this approach with a number of numerical results. The solution is adaptive and can be 
implemented with no over-sampling, unlike other schemes. 

12.6.9 Signal and interference model 

In general the signal transmitted by the desired user is modeled as: 

s(t) = ^b[t- iNT { - (1 - fl ,)A] cos co c t (12.120) 

i 

where 

N-l 

b{t) = Y J S\t-nT f -h{n)T c ] (12.121) 

«=o 

The signal can also be transmitted in the baseband with no carrier. In Equation (12.121), 
g(t) represents the basic pulse shape (monocycle pulse) and Tf represents frame duration 
during which there is only one pulse T c seconds wide. The sequence h(n) is the user's time- 
hopping code and its elements are integers taking values in the range 0 < h(n) < N — 1. 
The parameter T c is the duration of an addressable time bin within a frame. In other words, 
the right-hand side of Equation (12.121) consists of a block of N time-hopped monocycles. 
a, represents information bits (0,1). Equation (12.121) says that, if a, were all zero, the 
signal would be a repetition of b(f)-shaped blocks with period NTf. A may be viewed as 
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the time shift impressed by a unit data symbol on the monocycles of a block. It is clear 
that the choice of A affects the detection process and can be exploited to optimize system 
performance. In summary, the transmitted signal consists of a sequence of Z?(f)-shaped 
position-modulated blocks. 

The code sequence restarts at every data symbol. This 'short-code' assumption is made 
here for the sake of simplicity and is in keeping with some trends in the design of third- 
generation CDMA cellular systems. Longer codes are conceivable and perhaps more at- 
tractive but lead to more complex channel estimation schemes. The OFDM interference, 
generated for example by a WLAN user, is modeled as 



7(0 



/v — i 

i=0 L >i 



Ji cos 



2jt 



t + (Pi 



(12.122) 



where N is the number of channels, d t is the FDM interference information bits, f c + A/ c 
is the first channel carrier frequency, 7, is the OFDM interference amplitudes, <pj is the 
channel phase at the receiver input and 7) is the bit interval. 
The MC CDMA interference can be modeled as 
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(12.123) 



where N is the number of channels, d k (t) is the kth user interference bit, c k (i) is the PN 
sequence of the kth user and the ith channel, K is the number of users, f c + A/ c is the first 
channel carrier frequency, J k i is the interference amplitude, (p k j is the channel phase at the 
receiver input and Tj is the bit interval. 



12.6.10 Receiver structure 

The receiver block diagram is shown in Figure 12.22. There are two interference rejection 
circuits, A and B . The first one (A) processes signal if logic one is sent, and the other 
(B) processes signal for logic zero. When several time-hopping signals are simultaneously 
received over a channel with L c paths, the composite waveform at the output of the receiver 
antenna may be written as: 



r(0 = \.Yhs{t - xi)cosw c t + y Q,s(t - T/)sinft; c r] + n(t) + j(t) (12.124) 
i=i 

where n(t) is noise, and j(t) is total interference, yi — Y h + jV Qi i s me complex atten- 
uation and X{ is the delay in the Zth path. If we consider signal sampled at chip interval T c 
we have: 

r(k) = rI(k) + jrQ(k) (12.125) 

where 

rl(t) - r(t) cos co c t (12.126a) 
r Q(t) = r (t)smco c t (12.126b) 
k= l - (12.127) 
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Figure 12.22 Receiver block diagram. 

The detection variable in the /th rake receiver finger is: 

Dl\i) = DAl'(i) - DBl'{i) 

DQ l (i)= DAQ'(i)- DBQ l (i) 

(,+Divf-i 
DAl'(i)= rl(k)bik-i 

. Tr Ti V 



Tf T; 
! N— - — 



(i+l)N 
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(12.128) 
(12.129) 

(12.130) 
(12.131) 



DBl'(i)= £ \ I{k )b(k-iN^-^- (12.132) 



DBQ'(i)= £ \ Q{k)b ( k - iN lL- X -L- ±\ (12.133) 



where i represents the bit index. Now, the signal at the output of RAKE combiner is: 



d(i) = [Dl'(i)Tl'(i) + DQ l (i)TQ l (i)] 



(12.134) 



i=i 
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The weight of the / branch in the Zth finger is: 

Tl'(i) = DAlHj) + DBI'(i) (12.135) 
and the weight of the Q branch in the Zth finger is: 

TQ l (i)=DAQ l (i) + DBQ l (i) (12.136) 

12.6.11 Interference rejection circuit model 

Interference rejection at UWB radio system may be performed by a transversal filter 
employing LMS algorithm. Basically, in the first step, the interfering signal is estimated in 
the presence of the UWB signal which is at that stage considered as an additional noise. 
The estimated interference j is subtracted from the overall input signal r, creating the input 
signal r' — r — j — s + n + j — j — s + n + Aj to the standard UWB receiver. In order 
to predict the interference signal, sampling is performed at frame rate, and the adaptation 
of filter weights using LMS algorithm is performed at bit rate. It is already known that the 
changes of the symbol in the interfering signal will disrupt the estimation process. Curve 3, 
in Figure 12.23, shows the detection variable at the output of the transversal filter when there 
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Figure 12.23 Detection variable: 1, no interference; 2, interference with J.S = 40 dB, 
with interference rejection circuit; 3, interference with J : S — 40 dB, with 
transversal filter using classical LMS algorithm, PSK interference, M — 4, 
A = 5 ns, v hJ — 100 Msymbol/s, u bTH = 5 Mbt/s, 7f rame = 10 ns, A/ c = 
800 MHz. 
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Figure 12.24 Interference rejection circuit block diagram. THj(t) = time hopping sequence 
[THj(t) = b(t - iTVTf - j A), j = 1 for filter A, j = 0 for filter B]; SH = 
sample and hold circuit; BPF (F) , BPF (B) = backward prediction filter; FPF (F) , 
ppp(B) _ f orwarc j prediction filter; SEL. LOG. = selection logic; A, = am- 
plifier, i = 1,2; and C, = comparator, i = 1,2. 



is PSK interference (with interference to signal ratio of 40 dB) at the input of the receiver 
together with the useful UWB signal and Gaussian noise. The presence of an impulse inter- 
ference may be seen in the figure. The appearance of the impulse interference is very similar 
to the one at DSSS system using transversal filter for interference rejection [121]. A 'U' 
structure, that successfully rejects this impulse interference, was proposed in the mentioned 
reference. Similarly, in this section we discuss a structure, shown in Figure 12.24, for the 
interference rejection, which is based on the 'U' structure and has been modified to match 
the UWB radio environment. 

Curve 2 in Figure 12.23 shows the signal at the output of the interference rejection 
structure in the case when there is PSK interference with interference-to-signal ratio of 
40 dB at the input of the receiver. It can be noted that the interference is rejected and the 
detection variable is very similar to the one when there is no interference (curve 1). 

More details on operating principle of the interference rejection circuit is shown in 
Figures 12.24 and 12.25. We consider the case of a bit interval having 20 frames with five 
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Figure 12.25 Signal processing. 



samples per frame (M = 2). Central samples, within each frame, carry the same information 
about the useful signal and the interference symbols, and, since each sample belongs to a 
different frame, all those samples originate from different instances of time. Similarly, 
samples from different frames equally distant from the central sample carry the correlated 
interference signal. Therefore, an equivalent signal may be formed in the following way: 
Mth equivalent signal sample is the sum of Mth samples from each frame. Adaptation 
of filter weights using LMS algorithm and interference prediction is performed using the 
equivalent signal samples. 

As already mentioned, the changes of the symbol in the interfering signal will disrupt 
the estimation process so that a forward and backward prediction are used simultaneously. 
When the symbol change occurs, the filter with less disruption (smaller error at its output) 
is used to deliver the estimates. This process is described in the following in more detail. 
For additional insight into the problem the reader is also referred to Cox and Reudink [31]. 
Possible moments of the transition to happen are shown in Figure 12.25 and are denoted 
with 1°, 2°, 3° and 4°. Therefore, at frame rate BPF (F) (backward prediction filter) and 
ppp(F) (forward prediction filter) filters are operating with weights being forwarded from 
the LMS algorithm adapted by the equivalent signal. BPF (F) and FPF (F) filters have the 
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same weights during the useful signal bit interval, i.e. within all the frames belonging to the 
same bit interval. So the interference is predicted using the same weights computed using 
the equivalent signal. If, in cases 1°, and 2°, we discard samples belonging to the BPF (F) 
filter we will also discard the interference transition influence on the prediction. For cases 3° 
and 4°, samples that belong to the FPF (i?) filter should be discarded. This sample discarding 
is performed in the selector S at the outputs of BPF (F) and FPF (F) filters, based on the error 
signal. 

Therefore, if there is no interference signal transition during the sampling within one 
frame, the equivalent signal will be formed using all the samples from that frame. Also, 
the same equivalent central sample (SX0 (B) = SX0 (F) ) is passed to both BPF (B) and FPF (B) 
LMS algorithms. On the other hand, if there is interference signal transition during the 
sampling within one frame, the equivalent signal will be formed using samples from FPF (F) 
(cases 1° and 2°) or BPF (F) (cases 3° and 4°), and there will be two different equivalent 
central samples. 

For the described Interference rejection circuit we have: the first part of the interference 
rejection circuit processes data at frame level, and at each frame the following input signal 
processing is performed. At filter A, the signal is sampled very close in time to the useful 
signal. For -M<m<Mwe have: 

A l m (n) = AI l m (n) + j AQ l m (n) (12.137) 
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For filter B we have: 



B' m (n) = Bll n (n) + jBQ l m (n) 



(12.140) 
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After that, variables CI and C2, which the operation of one side of filter X (A and B) is 
based on, are determined: 



CXl'(n) : 



0. ( E X' m (n)W m (i)]) >aAy,K, 

K m=—M / \m=\ 
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CX2'(n) = 
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(12.144) 



where A,-(i = 1, 2) are constants (for A, — > oo, the selector selects all the samples and the 
structure operates as a traditional LMS algorithm). These gains are introduced because of 
the decrease of noise influence on the irregular selections. 

The second part of the interference rejection circuit operates at bit interval level 
Tb = NTf, and for each ith bit we have the following signals: 
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The error signal used for w coefficients adaptation is: 

-l 



EKl\i) = SX\\i)- SX l m (i)W m (i) 



m=—M 



(12.148) 



(12.149) 



EX2\i) = SX2'(i) - J2 SX l Ji)W m (i) 



(12.150) 



The filter output signal, being led to the RAKE combiner, is: 

DX l (i) = EXl'(i) + EX2'(i) 



(12.151) 
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The adaptation algorithm is defined as: 

nEXl l (i)\SX l (i)]* 
W m (i + 1) = W m (i) + l - -M<m<-1 (12.152) 
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Finally, the interference rejection circuit is made symmetrical in the following way: 

\W m (i + 1)+ W_ m (i + if] 
W m (i + 1) = ^-^ — 1 < m < M (12.154) 
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12.6.12 Performance analysis 

The error probability per bit is: 
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Ni is the bit ensemble size (measured in number of information bits) used for averaging the 
result and A^ a is the number of ensemble members. 
Estimated signal-to-noise ratio per bit is: 
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where d j (i) is the jth ensemble member. 



12.6.13 Performance examples 

Figure 12.26 presents the results for BER as a function of signal to noise ratio SNR, 
in the presence of different PSK/QAM type interfering signals. Additional parameters of 
the signals are: filter length M — 4, A = 5 ns, v\, s — 100 Msymbol/s, i>bTH = 5 Mbt/s, 
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Figure 12.26 Error probability as a function of signal-to-noise ratio. Error probability based 
on Monte-Carlo simulation: a, no interference, without interference rejection 
filter; b, no interference, with interference rejection filter; c, PSK interference, 
J: S — 40 dB, with interference rejection filter; d, QPSK interference, J.S — 
40 dB, with interference rejection filter; e, 16QAM interference, J:S — 40 
dB, with interference rejection filter;/, 64QAM interference, J: S — 40 dB, 
with interference rejection filter; g, 256QAM interference, /: S — 40 dB, with 
interference rejection filter; Error probability based on estimated detection 
variable signal to noise ratio: b\, the same parameters as b; d\, the same 
parameters as d; f\, the same parameters as/; A/ c = 800 MHz, M — 4, A = 
5 ns, Vbj — 100 Msymbol/s, ivm = 5 Mbt/s, 7f rame = 10 ns. 

A/ c = 800 MHz and 7f rame = 10 ns. One can see: (1) fair agreement of simulation and 
numerical results; (2) the performance results are close to no interference case, although 
interference with the level of 40 dB above the UWB signal is present; (3) there is also a slight 
degradation of the performance when the interfering signal constellation size is increased. 

Figure 12.27 presents the results for BER as a function of interference to signal ratio J. S, 
in the presence of different PSK/QAM-type interfering signals. Additional parameters of 
the signals are: filter length M — 4, SNR=7 dB, A = 5 ns, v bJ — 100 Msymbol/s, jj bTH = 5 
Mbt/s, A/ c = 800 MHz and 

Tframe — 10 ns. One can see that, when J. S becomes larger than 
zero (5 dB), the BER increases rapidly if there is now interference suppression (curves A). 
The performance is very similar if a standard LMS algorithm is used (curves C). On the other 
hand the U-type filter is performing significantly better (curves B). There is again a slight 
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Figure 12.27 Error probability as a function of interference-to-signal ratio. A, without in- 
terference rejection; B, with interference rejection circuit; C, with classical 
LMS interference rejection filter; a, PSK interference; b, QPSK interference; 
c, 16QAM interference; d, 64QAM interference; e, 256QAM interference; 
A/ c = 800 MHz, M = 4, SNR = 7 dB, A = 5 ns; v hJ = 100 Msymbol/s, 
VbTH = 5 Mbt/s, Tf mme = 10 ns. 



degradation of the performance when the interfering signal constellation size is increased. 
Figure 12.28 presents the results for BER as a function of interference symbol duration 7) / T c 
in the presence of different PSKVQAM-type interfering signals. Additional parameters of the 
signals are J: S = 30 dB, SNR = 7 dB, M = 4, A = 5 ns, D bTH = 5 Mbt/s, A/ c = 800 MHz 
and 7f rame = 10 ns. One can see that BER decreases when Tj/T c increases. There is again 
a slight degradation of the performance when the interfering signal constellation size is 
increased. Figure 12.29 presents the results for BER as a function of interference symbol 
duration Tj/T c and the number of subcarriers N in the presence of OFDM-type interfering 
signals. Additional parameters of the signals are J: S — 30 dB, SNR = 7 dB, M — 4, A = 
5 ns, D bTH = 5 Mbt/s and r frame = 10 ns, A/ c = 800 MHz and 16QAM per subcarrier. One 
can see that BER decreases when Tj/T c increases up to Tj/T c & 200. Beyond that point 
there is no significant reduction in BER if Tj/T c is further increased. There is a significant 
degradation of the performance when the number of subcarriers in the OFDM signal is 
increased. 




0.01 




Figure 12.28 Error probability as a function of interference bit duration, a, PSK inter- 
ference; b, QPSK interference; c, 16QAM interference; d, 64QAM inter- 
ference; e, 256QAM interference; A/ c = 800 MHz; J:S = 30 dB; SNR = 
7 dB; M — 4; A = 5 ns; D bTH = 5 Mbt/s; r frame = 10 ns. 
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Figure 1 2. 29 Error probability as a function of OFDM interference bit duration and the num- 
ber of subcarriers. OFDM/16QAM interference; A/ c = 800 MHz; 7:5 = 
30 dB; SNR = 7 dB; M = 4; A = 5 ns; i; bTH = 5 Mbt/s; r frame = 10 ns. 
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Figure 12.30 Error probability as a function of OFDM interference bit duration. A, in- 
terference rejection circuit; B, classical LMS interference rejection filter; a, 
OFDM/PSK interference; b, OFDM/QPSK interference; c, OFDM/16QAM 
interference; d, OFDM/ 64QAM interference; e, OFDM/256QAM interfer- 
ence; N = 16,A/ C = 800 MHz, /: S = 30 dB, SNR = 7 dB, M = 4, A = 5 
ns; ivth = 5 Mbt/s, T frame = 10 ns. 



Figure 12.30 presents the results for BER as a function of interference symbol duration 
Tj/T c in the presence of OFDM-type interfering signals. Additional parameters of the 
signals are J:S = 30 dB, SNR = 7 dB, M = 4, A = 5 ns, u bTH = 5 Mbt/s and T fmme 
— 1 0 ns , A/ c = 800 MHz and N — 16. One can see again that BER decreases when 7) / T c 
increases. There is again a slight degradation of the performance when the interfering 
signal constellation size is increased. Once again, the U-type filter performs much better 
than the classical LMS algorithm. Figure 12.3 1 presents the results for BER as a function of 
interference symbol duration Tj/T c in the presence of MC CDMA-type interfering signals 
for different number of subcarriers /V and number of users K. Additional parameters of the 
signals are J: S = 30 dB, SNR = 7 dB, M = 4, A = 5 ns, v bTH = 5 Mbt/s and r frame = 10 
ns and A/ c = 800 MHz. One can see again that BER decreases when Tj/T c increases. The 
performance are improved if the number of subcarriers is decreased. 
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Figure 12.31 Error probability as a function of MC CDMA interference bit duration, a, 
N = 64, K = 10; b, N = 32, K = 5; c, N = 32, K = 10; d, N = 32, £ = 
20; e, JV = 16, # = 10. MC CDMA interference; J: S = 30 dB, SNR = 7 
dB, M = 4, A = 5 ns; u bTH = 5 Mbt/s, r frame = 10 ns, A/ c = 800 MHz. 



The performance is improved if the number of users is increased for the same overall 
power of the interfering signal. This can be explained by the fact that a sum of MC CDMA 
signals will create an equivalent multicarrier signal with fewer dominant subcarriers which 
are suppressed more effectively by the filter because the LMS algorithm better adjusts the 
filter weights. This is demonstrated in Figures 12.29 and 12.30 for OFDM signal. 

In this section we presented a U-type estimation filter based scheme for interference 
suppression in UWB systems and discussed its performance. It was shown that the scheme 
can be used to significantly improve the performance of UWB systems in the presence of 
interference from mobile communication systems such as GSM and WCDMA. It is also 
effective in the presence of WLAN systems which are based on OFDMA technology or in 
military communications where the interference is generated by intentional jamming. The 
section also discusses the effectiveness of the scheme to suppress MC CDMA, which is 
a candidate technology for 4G mobile communications. The results show that significant 
suppression gain up to 40 dB can be achieved in the presence of OFDM, WCDMA and MC 
CDMA. The effectiveness decreases if the size of the number of subcarriers is increased. 
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13 



Ad Hoc Networks 



13.1 ROUTING PROTOCOLS 

Self-organizing wireless networks composed of mobile nodes and requiring no fixed in- 
frastructure will be referred to as mobile ad hoc networks (MANET). These networks 
are characterized by dynamic topology. In other words, nodes can join or leave the net- 
work as well as being able to change the range of their transmissions. Each node acts 
as an independent router. Because of the wireless mode of communication, links will be 
bandwidth-constrained and of variable capacity. In addition, there will be limited transmit- 
ter range, energy-limitations and limited physical security. MAC and network protocols are 
of a distributed nature. Also, complex routing protocols with large transmission overheads 
and large processing loads on each node will be used. 

Dynamic topology is illustrated in Figure 13.1. Node mobility creates a dynamic topology, 
i.e. changes in the connectivity between the nodes as a direct function of the distance between 
each other, the characteristics of the area where they are deployed and, of course, the power 
of their transmitters. Node mobility and architecture reconfigurability in ad hoc networks 
have had a major effect on the design of routing protocols. MANETs are used for military 
applications and rescue and recovery scenarios, in local areas like offices, building WLANs, 
home networks, robot networks, sensor networks and personal Area networking, also for 
interconnection of wireless devices like games, and habitat, wildlife and micro climate 
monitoring. 

Based on the above characteristics of MANET the routing protocols should be distributed, 
based on both uni- and bi-directional links, energy efficient and secure. Performance metrics 
include: 

(1) data throughput, i.e. how well the network delivers packets from the source to 
destination; 

(2) delay, e.g. end-to-end delay of packet delivery; 
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Figure 13.1 Illustration of dynamic topology. 




Figure 13.2 Proactive/timer: route updates (<■ •►), routes ( ,) and data ( ►). 

(3) overhead costs, i.e. average of control packets produced per node; 

(4) power consumption, i.e. average power used by each node. 

Two classes of routing protocols are considered for these applications, proactive and reactive 
routing. Proactive routing, illustrated in Figure 13.2, maintains routes to every other node 
in the network. This is a table-driven protocol where regular routing updates impose large 
overhead. On the other hand, there is no latency in route discovery, i.e. data can be sent 
immediately. The drawback is that most routing information might never be used. These 
protocols are suitable for high traffic networks and most often are based on Bellman-Ford 
type algorithms. Reactive routing, illustrated in Figure 13.3 maintains routes to only those 
nodes which are needed. The cost of finding routes is expensive since flooding is involved. 
Owing to the nature of the protocol, there might be a delay before transmitting data. The 
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Figure 13.3 Reactive/on demand: route REQ (<■ •►), route REP ( .) and data 

flow ( ►). 




Figure 13.4 (a) Proactive/timer protocol operation when the topology is changed. 

(b) Reactive/on demand protocol operation when the topology is changed. 



protocol may not be appropriate for real-time applications but is good for low/medium 
traffic networks. 

When the network topology is changed, the two protocols will behave differently, as can 
be seen in Figure 13.4(a, b). Proactive/timer-based protocol must wait for the updates to be 
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Figure 13.5 Classification of ad hoc routing protocols [destination sequenced distance vec- 
tor (DSDV), clusterhead gateway switch routing (CGSR), wireless routing 
protocol (WRP), ad hoc on-demand distance vector (AODV), dynamic source 
routing (DSR), temporally ordered routing algorithm (TORA), lightweight 
mobile routing (LMR), signal stability routing (SSR) and associativity based 
routing (ABR)]. 

transmitted and processed and for new routing tables to be built. For reactive/on demand 
protocol a new route request is sent and a new route is found independently. 

13.1.1 Routing protocols 

Classification of ad hoc routing protocols, discussed in References [1-24] is given in 
Figure 13.5. Table-driven routing protocols attempt to maintain consistent, up-to-date rout- 
ing information from each node to every other node in the network. These protocols require 
each node to maintain one or more tables to store routing information, and they respond 
to changes in network topology by propagating updates throughout the network in order 
to maintain a consistent network view. The areas in which they differ are the number of 
necessary routing-related tables and the methods by which changes in network structure 
are broadcast. We start this section by discussing some of the existing table-driven ad hoc 
routing protocols. 

13. 1. 1. 1 Destination sequenced distance vector, DSDV 

Traditional distance vector algorithms are based on the classical Bellman-Ford (DBF) 
algorithm discussed in Chapter 7. This algorithm has the drawback that routing loops 
can occur. To eliminate or minimize the formation of loops the nodes are required to 
often coordinate and communicate among themselves. The problem is caused by frequent 
topological changes. The RIP is based on this type of algorithm. The application of RIP to 
ad hoc networks is limited since it was not designed for such an environment. The objective 
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of DSDV protocols is to preserve the simplicity of RIP and avoid the looping problem in a 
mobile wireless environment. The main features of DSDV are: 

• routing tables have entries with number of hops to each network destination; 

• each route table entry is tagged with a sequence number originated by the destination; 

• nodes periodically communicate their routing table to their neighbors and when there is 
a significant new information available; 

• routing information is normally transmitted using a broadcasting or multicasting mode. 
Route tables entry consists of: 

(1) destination address; 

(2) number of hops required to reach the destination; 

(3) sequence number of the information received regarding that destination, as origi- 
nally stamped by the destination. 

Within the headers of the packet, the transmitter route tables usually carry the node hardware 
address, the network address and the route transmission sequence number. Receiving a 
transmission from a neighbor does not indicate immediately the existence of a bi-directional 
link with that neighbor. A node does not allow routing through a neighbor until that neighbor 
shows that it also has the node as a neighbor. This means that DSDV algorithms use only 
bi-directional links. 

An important parameter value is the time between broadcasting routing information 
packets. However when any new and substantial route information is received by a mobile 
node, the updated route information is transmitted as soon as possible. 

Broken links are a normal occurrence in ad hoc networks. They are detected by the MAC 
layer or inferred if no transmissions are received from a neighbor for some time. A broken 
link is given a metric of oo and an updated sequence number. A broken link translates into a 
substantial route change and thus this new routing information is immediately broadcasted 
to all neighbors. To propagate new routing information, particularly the one generated by 
broken links, and to avoid large transmission overheads, two types of routing packets are 
used: (1) full dump - carries all the available information; and (2) incremental - carries 
only information changed after the last dump. When a node receives a new routing packet 
the information is compared with the one already available at the node. Routes with a 
more recent sequence number are always used. Newly computed routes are scheduled for 
immediate advertisement. The route updating process is illustrated in Figure 13.6. 

When A moves and it is detected as a routing neighbor by G and H, it causes these nodes 
to advertise their updated routing information (incremental update). Upon reception of this 
update, F updates its own routing tables and broadcasts the new information. D receives this 
update and carries out an update of its routing table. The steps are illustrated in Figure 13.7. 

13. 1. 1.2 Clusterhead gateway switch routing 

The clusterhead gateway switch routing (CGSR) protocol differs from the previous protocol 
in the type of addressing and network organization scheme employed. Instead of a 'flat' 
network, CGSR is a clustered multihop mobile wireless network with several heuristic rout- 
ing schemes [4]. In the next section we will see that, by having a cluster head controlling a 
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Figure 13.6 Illustration of route updating process. Metric = number of hops to reach des- 
tination; sequence number = the freshness of received route, used to avoid 
routing loops; install time = when a received route was installed, used for 
damping route fluctuations. 



group of ad hoc nodes, a framework for code separation (among clusters), channel access, 
routing and bandwidth allocation can be achieved. A cluster head selection algorithm is 
utilized to elect a node as the cluster head using a distributed algorithm within the cluster. 
The disadvantage of having a cluster head scheme is that frequent cluster head changes can 
adversely affect routing protocol performance since nodes are busy in cluster head selection 
rather than packet relaying. Hence, instead of invoking cluster head reselection every time 
the cluster membership changes, a least cluster change (LCC) clustering algorithm is intro- 
duced. Using LCC, cluster heads only change when two cluster heads come into contact, 
or when a node moves out of contact of all other cluster heads. 

CGSR uses DSDV as the underlying routing scheme, and hence has much of the same 
overhead as DSDV. However, it modifies DSDV by using a hierarchical cluster-head-to-gate 
way routing approach to route traffic from source to destination. Gateway nodes are nodes 
that are within a communication range of two or more cluster heads. A packet sent by a 
node is first routed to its cluster head, and then the packet is routed from the cluster head to 
a gateway to another cluster head, and so on until the cluster head of the destination node is 
reached. The packet is then transmitted to the destination. Using this method, each node must 
keep a "cluster member table" where it stores the destination cluster head for each mobile 
node in the network. These cluster member tables are broadcast by each node periodically 
using the DSDV algorithm. Nodes update their cluster member tables on reception of such 
a table from a neighbor. 



ROUTING PROTOCOLS 471 



D's updated routing table 


Destination 


Next hop 


Metric 


Sequence no. 


Install time 




F 




S412 A 


T509 D 






j 


S137 B 


T001 D 






2 




1 y)\jj_U 


D 


D 


0 


S312_D 


1UU1_D 


E 


F 


2 


S400_E 


T005_D 


F 


F 


i 


MDU_r 


1 UU1_1J 


G 


F 


2 


S111_G 


T007_D 


H 


F 


3 


S233_H 


T007_D 


I 


I 


1 


S150_I 


T001_D 


J 


J 


1 


S130-J 


T001_D 



D's updated ad 1 


/ertised routing 


table 


Destination 


Metric 


Sequence no. 


A 


3 


S412_A 


B 


1 


S137_B 


C 


2 


S201_C 


E 


2 


S400_E 


F 


1 


S150_F 


G 


2 


S111_G 


H 


3 


S233_H 


I 


1 


S150_I 


J 


1 


S130_J 




Figure 13.7 Illustration of route updating process after A moves. 



In addition to the cluster member table, each node must also maintain a routing table 
which is used to determine the next hop in order to reach the destination. On receiving 
a packet, a node will consult its cluster member table and routing table to determine the 
nearest cluster head along the route to the destination. Next, the node will check its routing 
table to determine the next hop used to reach the selected cluster head. It then transmits the 
packet to this node. 



13. 1. 1.3 The wireless routing protocol 

The wireless routing protocol (WRP) described in Murthy and Garcia-Luna-Aceves [5] is a 
table-based protocol with the goal of maintaining routing information among all nodes 
in the network. Each node in the network is responsible for maintaining four tables: 
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(1) distance table; (2) routing table; (3) link-cost table; and (4) message retransmission 
list (MRL) table. 

Each entry of the MRL contains the sequence number of the update message, a re- 
transmission counter, an acknowledgment-required flag vector with one entry per neighbor, 
and a list of updates sent in the update message. The MRL records which updates in an 
update message need to be retransmitted and which neighbors should acknowledge the 
retransmission [5]. 

Mobiles inform each other of link changes through the use of update messages. An 
update message is sent only between neighboring nodes and contains a list of updates (the 
destination, the distance to the destination and the predecessor of the destination), as well as 
a list of responses indicating which mobiles should acknowledge (ACK) the update. Mobiles 
send update messages after processing updates from neighbors or detecting a change in a 
link to a neighbor. In the event of the loss of a link between two nodes, the nodes send 
update messages to their neighbors. The neighbors then modify their distance table entries 
and check for new possible paths through other nodes. Any new paths are relayed back 
to the original nodes so that they can update their tables accordingly. Nodes learn of the 
existence of their neighbors from the receipt of acknowledgments and other messages. If a 
node is not sending messages, it must send a hello message within a specified time period 
to ensure connectivity. Otherwise, the lack of messages from the node indicates the failure 
of that link. This may cause a false alarm. When a mobile receives a hello message from 
a new node, that new node is added to the mobile's routing table, and the mobile sends 
the new node a copy of its routing table information. Part of the novelty of WRP stems 
from the way in which it achieves loop freedom. In WRP, routing nodes communicate the 
distance and second-to-last hop information for each destination in the wireless networks. 
WRP belongs to the class of path-finding algorithms with an important exception. It avoids 
the 'count-to-infinity' problem [6], also discussed in Chapter 7, by forcing each node to 
perform consistency checks of predecessor information reported by all its neighbors. This 
ultimately (although not instantaneously) eliminates looping situations and provides faster 
route convergence when a link failure event occurs. 

13.1.2 Reactive protocols 

13. 1.2. 1 Dynamic source routing (DSR) 

In this case every packet carries the routing sequence. Intermediate nodes may learn routes 
from 'heard' traffic (RREQ, RREP, DATA). No periodic sending of routing packets occurs. 
The system may piggyback route requests on route replies and must use link layer feedback 
to find broken links. To send a packet the sender constructs a source route in the packet's 
header. 

The source route has the address of every host through which the packet should be 
forwarded to reach its destination. Each host in the ad hoc network maintains a route 
cache in which it stores source routes it has learned. Each entry in the route cache has an 
expiration period, after which it will be deleted. If the sender does not have a route to a 
destination it then attempts to find out by using a routing discovery process. This process 
is illustrated in Figures 13.8-13.10. While waiting for the routing discovery to complete, 
the sender continues sending and receiving packets with other hosts. Each host uses a route 
maintenance procedure to monitor the correct operation of a route. 
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Usually the data link layer has a mechanism to detect a link failure. When a link failure 
is detected the host sends a route error packet to the original sender of the packet. The 
route error packet has the address of the host who detected the failure and the host to which 
it was attempting to transmit the packet. When a route error packet is received by a host, 
the hop in error is removed from the host's route cache, and all routes which contain this 
hop are truncated at that point. 

To return the route error packet the host uses a route in its route cache to the sender of 
the original packet. If the host does not have a route it can reverse the route information 
carried in the packet that could not be forwarded because of the link error. The later assumes 
that only bidirectional links are being used for routing. Another option to return route error 
packets is to perform a route discovery process to find a route to the original sender and use 
that route. Several optimizations are possible to reduce the amount of overhead traffic. 

13. 1.2.2 Route cache 

During the process of route discovery and maintenance a host receives, directly or indi- 
rectly, information about routes to other hosts, thus minimizing the need to search for that 
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Figure 13.8 DSR - route discovery. 
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Figure 13.8 Continued. 
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Figure 13.9 DSR - route discovery decision process at source A. 
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Figure 13.10 DSR - route discovery decision process at an intermediate node. 



476 AD HOC NETWORKS 



Al A2 • 



A3 




A5 



Route request 



| A1-A2-A3-A4-A5 | < 



Route reply 



Figure 13.11 DSR - optimizations. 

information in the future. For example in the ad hoc network shown in Figure 13.1 1 let us 
assume that node Al performs a route discovery to A5. 

Since hosts A2, A3 and A4 are on the route to A5, host Al also learns the routes to A2, 
A3 and A4. Likewise, these 'intermediate hosts' learn about routes to each other by looking 
into the content of the route reply packet. 

13. 1.2.3 Piggybacking on route discoveries 

To minimize the delay in delivering a data packet when there is no route to the destination and 
a route discovery process is needed, one can piggyback the data on the route request packets. 

13. 1.2.4 Learning by 'listening' 

If the host operate in promiscuous receiving mode, i.e. they receive and process every 
transmission in their range, then they can obtain substantial information about routing, e.g. 
in the network in Figure 13.12. Nodes A2, A3 and A5 listen to the route error packet from 
A4 to Al. Since the route error packet identifies precisely the hop where the failure was 
detected, hosts B, C and D can update their route cache with this information. 

In summary, SDR is an on-demand protocol, with potentially zero control message 
overhead if the topology does not change frequently. Packet delays/jitters are associated 
with on-demand routing. It can work with unidirectional links as well as bidirectional 
links. Route caching is used to minimize the route discovery overhead. Promiscuous mode 




Figure 13.12 Learning about the routs by 'listening' 
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operations can translate in excessive use of power. This is not easily scalable to large 
networks since the protocol design assumes a small network diameter. The need to place 
the entire route in the route replies and data packets translates in large on-the-air packet 
overheads. Protocol allows for the possibility to keep multiple routes to a destination in the 
route cache. CPU and memory use demands on each host are high, since the routes have to 
be continuously maintained and updated. 

13. 1.2.5 Ad-hoc on demand distance vector (AODV) 

The protocol uses 'traditional' routing tables. Hello messages are sent periodically to iden- 
tify neighbors. Sequence numbers guarantee freshness. Route requests are sent in the reverse 
direction, i.e. they only use bi-directional links. The system may use link layer feedback 
to find broken links. The protocol is based on the destination-sequenced distance-vector 
(DSDV) algorithm. It provides on-demand route acquisition. Nodes maintain the route cache 
and use a destination sequence number for each route entry. The protocol does nothing when 
connection between end points is still valid. The route discovery mechanism, illustrated in 
Figure 13.13, is initiated by broadcasting a route request packet (RREQ), when a route to 
new destination is needed. 

The neighbors forward the request to their neighbors until either the destination or an 
intermediate node with a 'fresh enough' route to the destination is located. Route reply 
packets are transmitted upstream of the path taken by the route request packet to inform the 
original sender (an intermediate nodes) of the route finding. Route error packets (RERR) 
are used to erase broken links. 

13. 1.2.6 AODV- path maintenance 

Periodic hello messages can be used to ensure symmetric links and to detect link failures. 
Hello messages include a list of nodes that the host has heard of. Once the next hop becomes 
unavailable the host upstream of the link failure propagates an unsolicited RREP with a hop 
count of oo to all active upstream neighbors. Upon receiving notification of a broken link, 
a source node can restart the discovery process if they still need a route to destination. 

13. 1.2. 7 Temporal order routing algorithm (TORA) 

TORA is also a distributed protocol. It provides loop-free routes and multiple routes. It is 
source-initiated and creates routes to destination only when desired. 

The protocol minimizes reaction to topological changes, by localizing reaction to a very 
small group of nodes near the change and provides fast recovery upon route failures. Also, it 
detects a network partition when there is one and erases all invalid routes within a finite time. 

The protocol has three basic functions: route creation, route maintenance and route 
erasure. During the route creation, nodes use a 'height' metric to build a directed acyclic 
graph (DAG) rooted at the destination. Links are assigned that have a direction (upstream 
or downstream) based on the relative height metric of neighboring nodes, as illustrated in 
Figure 13.14. 

From Chapter 7, G(V, E) represents the network, where V is the set of nodes and E is 
the set of links. Each node i e V is assumed to have a unique identifier (ID), and each 




Figure 13.14 TORA. 
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link (i, j) G £, is assumed to be bidirectional, i.e. it allows two-way communications. The 
height metric associated with each node is of the form Hi — (ri, oidi, ri, Si, i) where: 

• ri is the logical time of a link failure, and defines a new reference level; 

• oidi is the unique ID of the node that defined the reference level; 

• ri denotes two unique sub-levels of a reference level; 

• Si is used to order the nodes with respect to a common reference level; and 

• i is the unique ID of the node itself. 

Each node i (other than the destination) maintains its height Hi. Initially the height is set 
to NULL, Hi — [-, -, -, -, i]. The height of the destination is always ZERO, H^id = 
[0, 0, 0, 0, dID]. At each node there is a height array with an entry NH,j for each neighbor 
i, € V. Initially the height of each neighbor is set to NULL; if a neighbor is a destination its 
corresponding entry in the height array is set to ZERO. 

The operation of TORA protocols is illustrated in Figures 13.15-13.18. The protocols 
are well suited for networks with large dense populations of nodes. They posses the ability 
to detect network partitions and erase all invalid routes within a finite time. 

Protocol quickly creates and maintains routes for a destination for which routing is 
desired. It minimizes the number of nodes reacting to topological changes. It needs a 
synchronization mechanism to achieve a temporal order of events. 

13. 1.2.8 Associativity-based routing 

The associativity based routing (ABR) protocol [12] is free from loops, deadlock and packet 
duplicates, and defines a different routing metric for ad hoc mobile networks. This metric 
is known as the degree of association stability. In ABR, a route is selected based on the 
degree of association stability of mobile nodes. Each node periodically generates a beacon 
to signify its existence. When received by neighboring nodes, this beacon causes their 
associativity tables to be updated. For each beacon received, the associativity tick of the 
current node with respect to the beaconing node is incremented. Association stability is 
defined by connection stability of one node with respect to another node over time and 
space. A high degree of association stability may indicate a low state of node mobility, 
while a low degree may indicate a high state of node mobility. Associativity ticks are reset 
when the neighbors of a node or the node itself move out of proximity. A fundamental 
objective of ABR is to derive longer-lived routes for ad hoc mobile networks. 

The three phases of ABR are: (1) route discovery; (2) route reconstruction (RRC); 
and (3) route deletion. The route discovery phase is accomplished by a broadcast query 
and await-reply (BQ-REPLY) cycle. A node desiring a route broadcasts a BQ message in 
search of mobiles that have a route to the destination. All nodes receiving the query (that are 
not the destination) append their addresses and their associativity ticks with their neighbors 
along with QoS information to the query packet. A successor node erases its upstream node 
neighbors' associativity tick entries and retains only the entry concerned with itself and its 
upstream node. In this way, each resultant packet arriving at the destination will contain the 
associativity ticks of the nodes along the route to the destination. The destination is then 
able to select the best route by examining the associativity ticks along each of the paths. 
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Figure 13.15 TORA - route creation. 

When multiple paths have the same overall degree of association stability, the route with 
the minimum number of hops is selected. The destination then sends a REPLY packet back 
to the source along this path. Nodes propagating the REPLY mark their routes as valid. 
All other routes remain inactive, and the possibility of duplicate packets arriving at the 
destination is avoided. RRC may consist of partial route discovery, invalid route erasure, 
valid route updates and new route discovery, depending on which node(s) along the route 
move. Movement by the source results in a new BQ-REPLY process. The RN message 
is a route notification used to erase the route entries associated with downstream nodes. 
When the destination moves, the immediate upstream node erases its route and determines 
if the node is still reachable by a localized query (LQ[H]) process, where H refers to the 
hop count from the upstream node to the destination. If the destination receives the LO 
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Figure 13.15 Continued. 



packet, it REPLYs with the best partial route; otherwise, the initiating node times out and 
the process backtracks to the next upstream node. Here an RN message is sent to the next 
upstream node to erase the invalid route and inform this node that it should invoke the LQ 
process. If this process results in backtracking more than halfway to the source, the LO 
process is discontinued and a new BQ process is initiated at the source. When a discovered 
route is no longer desired, the source node initiates a route delete (RD) broadcast so that all 
nodes along the route update their routing tables. The RD message is propagated by a full 
broadcast, as opposed to a directed broadcast, because the source node may not be aware 
of any route node changes that occurred during RRCs. 



13. 1.2.9 Signal stability routing 

Another on-demand protocol is the signal stability-based adaptive routing protocol (SSR) 
presented in Dube et til. [13]. Unlike the algorithms described so far, SSR selects routes 
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Figure 13.16 TORA - route creation (visualization). 




Figure 13.17 TORA - maintaining routes link failure with no reaction. 

based on the signal strength between nodes and a node's location stability. This route 
selection criteria has the effect of choosing routes that have 'stronger' connectivity. SSR can 
be divided into two cooperative protocols: the dynamic routing protocol (DRP) and the static 
routing protocol (SRP). The DRP is responsible for the maintenance of the signal stability 
table (SST) and routing table (RT). The SST records the signal strength of neighboring 
nodes, which is obtained by periodic beacons from the link layer of each neighboring node. 
Signal strength may be recorded as either a strong or weak channel. All transmissions are 
received by, and processed in, the DRP. After updating all appropriate table entries, the 
DRP passes a received packet to the SRP. 




Figure 13.18 TORA - re-establishing routes after link failure of last downstream link. 



484 AD HOC NETWORKS 




Figure 13.18 Continued. 



The SRP processes packets by passing the packet up the stack if it is the intended receiver 
or looking up the destination in the RT and then forwarding the packet if it is not. If no 
entry is found in the RT for the destination, a route-search process is initiated to find a route. 
Route requests are propagated throughout the network, but are only forwarded to the next 
hop if they are received over strong channels and have not been previously processed (to 
prevent looping). The destination chooses the first arriving route-search packet to send back 
because it is most probable that the packet arrived over the shortest and/or least congested 
path. The DRP then reverses the selected route and sends a route-reply message back to the 
initiator. The DRP of the nodes along the path update their RTs accordingly. 

Route-search packets arriving at the destination have necessarily chosen the path of 
strongest signal stability, since the packets are dropped at a node if they have arrived 
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over a weak channel. If there is no route-reply message received at the source within a 
specific timeout period, the source changes the PREF field in the header to indicate that 
weak channels are acceptable, since these may be the only links over which the packet can 
be propagated. When a failed link is detected within the network, the intermediate nodes 
send an error message to the source indicating which channel has failed. The source then 
initiates another route-search process to find a new path to the destination. The source also 
sends an erase message to notify all nodes of the broken link. 



13.2 HYBRID ROUTING PROTOCOL 

The zone routing protocol (ZRP) is a hybrid routing protocol that proactively maintains 
routes within a local region of the network (referred to as the routing zone). Knowledge of 
this routing zone topology is leveraged by the ZRP to improve the efficiency of a reactive 
route query/reply mechanism. The ZRP can be configured for a particular network through 
adjustment of a single parameter, the routing zone radius. A routing zone of radius r is 
defined for each node and includes the nodes whose minimum distance in hops from a 
given node is at most r hops. An example of a routing zone (for node S) of radius two 
hops is shown in Figure 13.19. Nodes within the circle, are said to be within the routing 
zone of the central node S. Nodes outside the circle are said to be outside S's routing zone. 
Peripheral nodes are nodes whose minimum distance to S is exactly equal to the zone radius. 
The remaining nodes are categorized as interior nodes. 



Routing zone 
of radius = 2 hops 



Neighboring 
node 




Pheripheral 
/ node 



Figure 13.19 A routing zone of radius r — 2 hops. 
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For a routing zone of radius r , the number of routing zone nodes can be regulated through 
adjustments in each node's transmitter power. Subject to the local propagation conditions 
and receiver sensitivity, the transmission power determines the set of neighbor nodes, i.e. 
those nodes that are in direct communication with a node. To provide adequate network 
reachability, it is important that a node be connected to a sufficient number of neighbors. 
However, more is not necessarily better. As the transmitters' coverage areas grow larger, so 
does the membership of the routing zones. This can result in an excessive amount of route 
update traffic. 

Each node is assumed to continuously (proactively) maintain routing information only 
to those nodes that are within its routing zone. Because the updates are only propagated 
locally, the amount of update traffic required to maintain a routing zone does not depend 
on the total number of network nodes (which can be quite large). This is referred to as the 
intrazone routing protocol (IARP). The interzone routing protocol (IERP) is responsible 
for reactively discovering routes to destinations located beyond a node's routing zone. The 
IERP operates as follows: the source node first checks whether the destination is within its 
zone. If so, the path to the destination is known, and no further route discovery processing 
is required. If the destination is not within the source's routing zone, the source broadcasts 
a route request (referred to as request) to all its peripheral nodes. Now, in turn, all the 
peripheral nodes execute the same algorithm: they check whether the destination is within 
their zone. If so, a route reply (referred to as reply) is sent back to the source indicating 
the route to the destination. If not, the peripheral node forwards the query to its peripheral 
nodes, which in turn execute the same procedure. 

An example of this route discovery procedure is demonstrated in Figure 13.20. The 
source node S needs to send a packet to the destination D. To find a route within the 
network, S first checks whether D is within its routing zone. If so, S knows the route to D. 
Otherwise, S broadcasts a query to its peripheral nodes; that is, S sends a query to nodes 
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Figure 13.20 Illustration of IERP operation. 
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H, G and C. Now, in turn, after verifying that D is not in its routing zone, each one of these 
nodes forwards the query by broadcasting the query to its peripheral nodes. In particular, 
H sends the query to B, which recognizes D as being in its routing zone and responds 
to the query, indicating the forwarding path: S-H-B-D. As indicated by this example, a 
route can be specified by a sequence of nodes that have received the successful IERP query 
thread. The manner in which this information is collected and distributed is specified by a 
route accumulation procedure. In the basic route accumulation, a node appends its ID to 
a received query packet. When a node finds the destination in its zone, the accumulated 
sequence of IDs specifies a route between querying source and destination. By reversing 
the accumulated route, a path is provided back to the query source. This information can 
be used to return the route reply through source routing. 

The intuition behind the ZRP is that querying can be performed more efficiently by 
broadcasting queries to the periphery of a routing zone rather than flooding the queries over 
the same area. However, problems can arise once the query leaves the initial routing zone. 
Because the routing zones heavily overlap, a node can be a member of many routing zones. 
It is very possible that the query will be forwarded to all the network nodes, effectively 
flooding the network. 

Yet a more disappointing result is that the IERP can result in much more traffic than the 
flooding itself, due to the fact that broadcasting involves sending the query along a path 
equal to the zone radius. Excess route query traffic can be regarded as a result of overlapping 
query threads (i.e. overlapping queried routing zones). Thus, the design objective of query 
control mechanisms should be to reduce the amount of route query traffic by steering 
threads outward from the source's routing zone and away from each other, as indicated 
in Figure 13.21. This problem is addressed primarily through appropriate mechanisms of 
query detection and query termination. 

13.2.1 Loop-back termination 

The query is terminated when the accumulated route (excluding the previous node) contains 
the host which lies in routing zone, e.g. for route = {S^A^B^C} in Figure 13.22, 
C terminates the query, because S is in C's routing zone. 
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Figure 13.21 Guiding the search in desirable directions. 
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Figure 13.22 Loop-back termination. 
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Figure 13.23 Early termination. 



13.2.2 Early termination 

When the ability to terminate route query threads is limited to peripheral nodes, threads are 
allowed to penetrate into previously covered areas, which generates unnecessary control 
traffic. This excess traffic can be eliminated by extending the thread termination capability to 
the intermediate nodes that relay the thread. This approach is referred to as early termination 
(ET). Figure 13.23 illustrates the operation of the ET mechanism. Node S broadcasts a route 
query with node C as one of the intended recipients. Intermediate node A passes along 
the query to B. Instead of delivering the query to node C, node B terminates the thread 
because a different thread of this query was previously detected. Intermediate nodes may 
terminate existing queries but are restricted from issuing new queries. Otherwise, the ZRP 
would degenerate into a flooding protocol. The ability to terminate an overlapping query 
thread depends on the ability of nodes to detect that a routing zone they belong to has 
been previously queried. Clearly, the central node in the routing zone (which processed the 
query) is aware that its zone has been queried. In order to notify the remaining routing zone 
nodes without introducing additional control traffic, some form of 'eavesdropping' needs 
to be implemented. The first level of query detection (QD1) allows the intermediate nodes, 
which transport queries to the edge of the routing zone, to detect these queries. In single 
channel networks, it may be possible for queries to be detected by any node within the 
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Figure 13.24 Query detection (QD1/QD2). 

range of a query-transmitting node. This extended query detection capability (QD2) can 
be implemented by using IP broadcasts to send route queries. Figure 13.24 illustrates both 
levels of advanced query detection. In this example, node S broadcasts to two peripheral 
nodes, B and D. The intermediate nodes A and C are able to detect passing threads using 
QD1. If QD2 is implemented, node E will be able to 'eavesdrop' on A's transmissions and 
record the query as well. 

The techniques just discussed improve the efficiency of the IERP by significantly reduc- 
ing the cost of propagating a single query. Further improvements in IERP performance can 
be achieved by reducing the frequency of route queries, initiating a global route discovery 
procedure only when there is a substantial change in the network topology. More specif- 
ically, active routes are cached by nodes: the communicating end nodes and intermediate 
nodes. Upon a change in the network topology, such that a link within an active path is 
broken, a local path repair procedure is initiated. The path repair procedure substitutes a 
broken link by a minipath between the ends of the broken link. A path update is then gen- 
erated and sent to the end points of the path. Path repair procedures tend to reduce the path 
optimality (e.g. increase the length for shortest path routing). Thus, after some number of 
repairs, the path endpoints may initiate a new route discovery procedure to replace the path 
with a new optimal one. 

13.2.3 Selective broadcasting (SBC) 

Rather than broadcast queries to all peripheral nodes, the same coverage can be provided by 
broadcasting to a chosen subset of peripheral nodes. This requires IARP to provide network 
topology information for an extended zone that is twice the radius of the routing zone. 

A node will first determine the subset of other peripheral nodes covered by its assigned 
inner peripheral nodes. The node will then broadcast to this subset of assigned inner pe- 
ripheral nodes which forms the minimal partitioning set of the outer peripheral nodes. 

This is illustrated in Figure 13.25. S's inner peripheral nodes are A, B and C. Its outer 
peripheral nodes are F, G, H, X, Y and Z. Two inner peripheral nodes of B (H and X) are 
also inner peripheral nodes of A and C. S can then choose to eliminate B from its list of 
broadcast recipients since A can provide coverage to H and C can cover X. 

The position of the routing functions in the protocol stack are illustrated in Figure 13.26. 
Route updates are triggered by the MAC-level neighbor discovery protocol (NDP). IARP is 
notified when a link to a neighbor is established or broken. IERP reactively acquires routes 
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Figure 13.25 Selective broadcasting. 
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Figure 13.26 The position of the routing functions in the protocol stack. 



Table 13.1 Simulation parameters 



Parameter 


Symbol 


Values 


Default 


Zone radius [hops] 


P 


1-8 




Node density [neighbors/node] 


S 


3-9 


6 


Relative node speed [neighbors/s] 


V 


0.1-2.0 


1.0 


Number of nodes [nodes] 


N 


200-1000 


500 



to nodes beyond the routing zone. IERP forwards queries to its peripheral nodes (BRP) 
keeping track of the peripheral nodes through the routing topology information provided 
bylARP. 

Pearlman and Haas [38] present the performance evaluation of the hybrid protocol, de- 
scribed above for the simulation set of parameters given in Table 13.1. From Figure 13.27 
one can see that the IARP control traffic per node is increased as the radius of the zone is 
increased. 

At the same time IERP traffic generated per zone would be reduced. So, the total ZRP 
traffic has a minimum value for some zone diameter r, which is demonstrated in Fig- 
ure 13.28. 
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Figure 13.28 ZRP traffic per node (N = 1000 nodes, v = 0:5 neighbors/s). 



13.3 SCALABLE ROUTING STRATEGIES 



13.3.1 Hierarchical routing protocols 

A hierarchical approach to routing, often referred to as hierarchical state routing (HSR), 
has been a traditional option when the network has a large number of nodes. The ap- 
proach has a lot in common with routing with aggregation presented in Chapter 7. Common 
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Figure 13.29 The network hierarchical structure. 




Figure 13.30 Three layers network hierarchical structure. 

table-driven protocols and on-demand protocols are for flat topologies and thus have a 
scalability problem when the network is large. For table-driven protocols there is high vol- 
ume of overhead transmissions. On the other hand, for on-demand protocols there is large 
discovery latency. 

The experience gained in wired networks suggests the use of a hierarchical structure to 
address the scalability problem. The use of hierarchical routing protocol in ad-hoc networks 
reduces overhead traffic and discovery latency but it has drawbacks, such as: suboptimal 
routes and complex management of the network hierarchical structure due to its dynamic 
nature. The basic idea is to divide the network into clusters or domains, as illustrated in 
Figures 13.29 and 13.30. 

The mobile nodes are grouped into regions, regions are grouped into super-regions, and 
so on, as shown in Figure 13.30. A specific mobile host is chosen as the clusterhead for each 
region. In hierarchical routing, mobile nodes know how to route packets to their destination 



SCALABLE ROUTING STRATEGIES 



493 



Level=l 



Level=2 




Physical radio link 

Virtual link 

<x,y,z> Hierarchical ID 



■ Cluster head 

O Gateway node 

O Internal node 

n Virtual node 




Data path 



from 5 to 10 



Level=0 



(physical level) 



10/<3,3,10> 



Figure 13.31 Physical multilevel clustering. 

within its own region, but do not know the route outside of its own region. Clusterheads 
know how to reach other regions. 

Figure 13.31 shows an example of physical clustering in more detail. At level / = 0, we 
have four physical level clusters, C0-1, CO-2, C0-3 and CO-4. Level 1 and level 2 clusters 
are generated by recursively selecting cluster heads. Different clustering algorithms can be 
used for the dynamic creation of clusters and the election of cluster heads [39, 40]. At level 
0 clustering, spread-spectrum radios and CDMA can be introduced for spatial reuse across 
clusters. Within a level 0 cluster, the MAC layer can be implemented by using a variety of 
different schemes (polling, MACA, CSMA, TDMA, etc.). 

Generally, as in ZRP, there are three kinds of nodes in a cluster, namely, the cluster head 
node (e.g. nodes 1-4), gateway node (e.g. nodes 6-8, and 1 1) and internal nodes (e.g. nodes 
5, 9 and 10). The cluster head node acts as a local coordinator of transmissions within the 
cluster. The node IDs shown in Figure 13.31 (at level / = 0) are physical (e.g. MAC layer) 
addresses. They are hardwired and are unique to each node. 

Within a physical cluster, each node monitors the state of the link to each neighbor (i.e. 
up/down state and possibly QoS parameters such as bandwidth) and broadcasts it within 
the cluster. The cluster head summarizes LS information within its cluster and propagates 
it to the neighbor cluster heads (via the gateways). The knowledge of connectivity between 
neighbor cluster heads leads to the formation of level 1 clusters. For example, as shown in 
Figure 13.31, neighbor cluster heads 1 and 2 become members of the level 1 cluster CI— 1. 
To carry out LS routing at level 1, an LS parameter of the 'virtual' link in Cl-1 between 
nodes 1 and 2 (which are neighbor cluster heads) is calculated from the LS parameters of the 
physical path from cluster head 1 to next cluster head 2 through gateway 6. More precisely, 
gateway 6 passes the LS update for link (6-2) to cluster head 1. Cluster head 1 estimates the 
parameters for the path (1-6-2) using its local estimate for (1-6) and the estimate for (6-2) 
it just received from gateway 6. The result becomes the LS parameter of the 'virtual link' 
between nodes 1 and 2 in Cl-1. This is equivalent to the aggregation process discussed in 
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Chapter 7. The virtual link can be viewed as a 'tunnel' implemented through lower level 
nodes. 

Applying the aforementioned clustering procedure (aggregation) recursively, new cluster 
heads are elected at each level and become members of the higher level cluster (e.g. node 
1 is elected as a cluster head at level 1 and becomes a member of level 2 cluster C2-1). 

Nodes within a cluster exchange virtual LS information as well as summarized lower- 
level cluster information. After obtaining the LS information at this level, each virtual node 
floods it down to nodes within the lower level cluster. As a result, each physical node has 
a 'hierarchical' topology information, as opposed to a full topology view as in flat LS 
schemes. The hierarchy so developed requires a new address for each node, the hierarchical 
address. There are many possible solutions for the choice of the hierarchical address scheme. 
In hierarchical state routing (HSR), we define the hierarchical ID (HID) of a node as the 
sequence of the MAC addresses of the nodes on path from the top hierarchy to the node 
itself. For example, in Figure 13.31 the hierarchical address of node 6 [called HID(6)], is 
3, 2, 6. In this example, node 3 is a member of the top hierarchical cluster (level 2). It is 
also the cluster head of Cl-3. Node 2 is member of Cl-3 and is the cluster head of CO-2. 
Node 6 is a member of CO-2 and can be reached directly from node 2. The advantage of 
this hierarchical address scheme is that each node can dynamically and locally update its 
own HID upon receiving the routing updates from the nodes higher up in the hierarchy. The 
hierarchical address is sufficient to deliver a packet to its destination from anywhere in the 
network using HSR tables. 

Referring to Figure 13.31, consider for example the delivery of a packet from node 5 to 
node 10. Note that HID(5) =<1,1,5> and HID(IO) =<3,3,10>. The packet is forwarded 
upwards (to node 1) to the top hierarchy by node 5. Node 1 delivers the packet to node 3, 
which is the top hierarchy node for destination 10. Node 1 has a 'virtual link', i.e. a tunnel, 
to node 3, namely, the path (1, 6, 2, 8, 3). It thus delivers the packet to node 3 along this 
path. Finally, node 3 delivers the packet to node 10 along the downwards hierarchical path, 
which in this case reduces to a single hop. 

Gateways nodes can communicate with multiple cluster heads and thus can be reached 
from the top hierarchy via multiple paths. Consequently, a gateway has multiple hierarchical 
addresses, similar to a router in the wired Internet (see Chapter 1), equipped with multiple 
subnet addresses. 



13.3.2 Performance examples 

Performance analysis of the system described above can be found in Iwata et al. [41]. 
In most experiments, the network consistes of 100 mobile hosts roaming randomly in all 
directions at a predefined average speed in a 1000 x 1000 m area(i.e. no group mobility 
models are used). A reflecting boundary is assumed. The radio transmission range is 120 m. 
A free space propagation channel model is assumed. The data rate is 2 Mb/s. The packet 
length is 10 kb for data, 2 kb for a cluster head neighbor list broadcast, and 500 b for MAC 
control packets. Transmission time is 5 ms for a data packet, 1 ms for a neighboring list, 
and 0.25 ms for a control packet. The buffer size at each node is 15 packets. Figures 13.32 
and 13.33 illustrate the tradeoffs between throughput and control overhead (O/H) in HSR 
when the route refresh rate is varied. 
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Figure 13.32 System performance vs the routing information refresh rate with v — 
90 km/h. 



In Figure 13.32 (at 90 km/h), we note that the O/H increases linearly with refresh rate 
until the network becomes saturated with control packets and starts dropping them. The 
data throughput first increases rapidly with the refresh rate, owing to more accurate routes 
and lower packet drops due to the lack of a route. Eventually, throughput peaks and then 
starts decreasing as the network becomes saturated, and data packets are dropped because of 
buffer overflow. Figure 13.33 reports the 'optimal' HSR refresh rate as a function of speed. 
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Figure 13.33 Optimum routing information refresh rate vs mobility. 
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13.3.3 FSR (fisheye routing) protocol 

This protocol represents a different way to reduce (aggregate) the amount of information 
used for routing purposes. In Kleinrock and Stevens [42], a 'fisheye' technique was proposed 
to reduce the size of information required to represent graphical data. The eye of a fish 
captures with high detail the pixels near the focal point. The detail decreases as the distance 
from the focal point increases. In routing, the fisheye approach translates to maintaining 
accurate distance and path quality information about the immediate neighborhood of a node, 
with progressively less detail as the distance increases. 

The FSR scheme presented in Iwata et al. [41] is built on top of another routing scheme 
called 'global state routing' (GSR) [43]. GSR is functionally similar to LS routing in that 
it maintains a topology map at each node. The key is the way in which routing information 
is disseminated. In LS, LS packets are generated and flooded into the network whenever 
a node detects a topology change. In GSR, LS packets are not flooded. Instead, nodes 
maintain an LS table based on the up-to-date information received from neighboring nodes 
and periodically exchange it with their local neighbors only (no flooding). 

Through this exchange process, the table entries with larger sequence numbers replace 
the ones with smaller sequence numbers. The GSR periodic table exchange resembles the 
DSDV, discussed earlier in this chapter, where the distances are updated according to the time 
stamp or sequence number assigned by the node originating the update. In GSR (like in 
LS), LSs are propagated, a full topology map is kept at each node, and shortest paths are 
computed using this map. 

In a wireless environment, a radio link between mobile nodes may experience frequent 
disconnects and reconnects. The LS protocol releases an LS update for each such change, 
which floods the network and causes excessive overhead. GSR avoids this problem by 
using periodic exchange of the entire topology map, greatly reducing the control message 
overhead [43]. The drawbacks of GSR are the large size update message that consumes a 
considerable amount of bandwidth and the latency of the LS change propagation, which 
depends on the update period. This is where the fisheye technique comes to help, by reducing 
the size of update messages without seriously affecting routing accuracy. 

Figure 13.34 illustrates the application of fisheye in a mobile wireless network. The 
circles with different shades of gray define the fisheye scopes with respect to the centre 
node (node 11). The scope is defined as the set of nodes that can be reached within a 
given number of hops. In our case, three scopes are shown for one, two and three hops, 
respectively. Nodes are color-coded as black, gray and white, accordingly. The reduction 
of update message size is obtained by using different exchange periods for different entries 
in the table. More precisely, entries corresponding to nodes within the smaller scope are 
propagated to the neighbors with the highest frequency. Referring to Figure 13.35, entries in 
bold are exchanged most frequently. The rest of the entries are sent out at a lower frequency. 
As a result, a considerable fraction of LS entries are suppressed, thus reducing the message 
size. This strategy produces timely updates from near stations, but creates large latencies 
from stations that are far away. However, the imprecise knowledge of the best path to a 
distant destination is compensated for by the fact that the route becomes progressively 
more accurate as the packet gets closer to its destination. 

In summary, FSR scales well to large networks, by keeping LS exchange overhead (O/H) 
low without compromising route computation accuracy when the destination is near. By 
retaining a routing entry for each destination, FSR avoids the extra work of 'finding' the 
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Figure 13.34 Illustration of a fisheye. 

destination (as in on-demand routing) and thus maintains low-single packet transmission 
latency. As mobility increases, routes to remote destinations become less accurate. However, 
when a packet approaches its destination, it finds increasingly accurate routing instructions 
as it enters sectors with a higher refresh rate. 

Figure 13.36 shows the increase in the control O/H as a function of number of nodes. 
Geographical node density is kept the same for all runs, as shown in Table 13.2 [41]. One can 
see that as network size grows larger, the fisheye technique aggressively reduces the O/H. 



13.4 MULTIPATH ROUTING 

A routing scheme that uses multiple paths simultaneously by splitting the information 
between a multitude of paths, so as to increase the probability that the essential portion of 
the information is received at the destination without incurring excessive delay is referred 
to as multipath routing. Such a scheme is needed to mitigate the instability of the topology 
(e.g. failure of links) in an ad hoc network due to nodal mobility and changes in wireless 
propagation conditions. The scheme works by adding an overhead to each packet, which 
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Figure 13.35 Message reduction using fisheye. 

Table 13.2 Node density (nodes vs area) 



Number of nodes 


Simulation area 


25 


500 x 500 


49 


700 x 700 


100 


1000 x 1000 


225 


1500 x 1500 


324 


1800 x 1800 


400 


2000 x 2000 



is calculated as a linear function of the original packet bits. The process has its analogy in 
coding theory. The resulting packet (information and overhead) is fragmented into smaller 
blocks and distributed over the available paths. The probability of reconstructing the original 
information at the destination is increased as the number of used paths is increased. 

A lot of research has been done in the area of multipath routing in wired networks. 
One of the initial approaches to this problem was the dispersity routing [44]. In order 
to achieve self-healing and fault tolerance in digital communication networks, diversity 
coding is suggested in Ayanoglu et al. [45]. In Krishnan and Silvester [46], a per-packet 
allocation granularity for multipath source routing schemes was shown to perform better 
than a per-connection allocation. 
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Figure 13.36 Control O/H vs number of nodes. 

An exhaustive simulation of the various tradeoffs associated with dispersity routing is 
presented in Banerjea [47]. The inherent capability of this routing method to provide a 
large variety of services was pointed out. Owing to this fact, numerous schemes employing 
multipath routing have been proposed for wired networks in order to perform QoS routing 
[48-55]. All these protocols are based on proactive routing, since they maintain tables that 
reflect the state of the entire network. For this reason, owing to the unreliability of the 
wireless infrastructure and the nodal mobility, which can trigger an excessive amount of 
updates in the state tables, they cannot be successfully applied to mobile networks. 

The application of multipath techniques in mobile ad hoc networks seems natural, as mul- 
tipath routing allows reduction of the effect of unreliable wireless links and the constantly 
changing topology. The on-demand multipath routing scheme is presented in Nasipuri and 
Das [56] as a multipath extension of dynamic source routing (DSR) [57], described in 
Section 13.1. The alternate routes are maintained, so that they can be utilized when the 
primary one fails. TORA [58], routing on demand acyclic multipath (ROAM) [59] and 
ad hoc on-demand distance vector-backup routing (AODV-BR) [60], which is based on the 
ad hoc on-demand distance vector (AODV) protocol [61], are also examples of schemes 
that maintain multiple routes and utilize them only when the primary root fails. However, 
these protocols do not distribute the traffic into the multiple available paths. 

Another extension of DSR, multiple source routing (MSR) [62], proposes a weighted 
round-robin heuristic-based scheduling strategy among multiple paths in order to distribute 
load, but provides no analytical modeling of its performance. The split multipath routing 
(SMR), proposed in Lee and Gerla [63], focuses on building and maintaining maximally dis- 
joint paths; however, the load is distributed only in two routes per session. In Papadimitratos 
et al. [64], the authors propose a novel and nearly linear heuristic for constructing a highly 
reliable path set. In Pearlman et al. [65], the effect of alternate path routing (APR) on load 
balancing and end-to-end delay in mobile ad hoc networks has been explored. It was argued, 
however, that the network topology and channel characteristics (e.g. route coupling) can 



500 AD HOC NETWORKS 



severely limit the gain offered by APR strategies. In an interesting application [66], multi- 
path path transport (MPT) is combined with multiple description coding (MDC) in order 
to send video and image information in a multihop mobile radio network. In this section , 
we discuss a multipath scheme for mobile ad hoc networks based on diversity coding [45]. 
Data load is distributed over multiple paths in order to minimize the packet drop rate and 
achieve load balancing in a constantly changing environment. 

Suppose that w max paths are available for the transmission of data packets from a source to 
a destination. Any of the multipath schemes mentioned in the introduction can be employed 
in order to acquire these paths. No paths have nodes in common (mutually disjoint). 

Each path, indexed as i, i = 1, . . -n m . dx , is either down at the time that the source 
attempts to transmit with probability of failure /?, or the information is received correctly 
with probability 1 — /?,-. Since there are no common nodes among the paths, they are con- 
sidered independent in the sense that success or failure of one path cannot imply success 
or failure of another. It should be noted here that, in wireless ad hoc networks, nodes are 
sharing a single channel for transmission, so node disjointness does not guarantee the inde- 
pendence of the paths. Taking this into account, the paths are ideally considered independent 
as an approximation of a realistic ad hoc wireless network. For a more realistic modeling 
of the paths in a wireless network, one may refer to Tsirigos and Haas [67], where path 
correlation is included in the analysis. The failure probabilities of the available paths are 
organized in the probability vector p = {/?,}, in such a way that /?, < p i+1 . The vector of 
success probabilities is denned as q {g,} = 1 — p = {l — p{\. 

Let us now suppose that we have to send a packet of D data bits utilizing the set of 
available independent paths in such a way as to maximize the probability that these bits 
are successfully communicated to the destination. This probability is denoted as P. In 
order to achieve this goal, we employ a coding scheme in which C extra bits are added as 
overhead. The resulting B — D + C bits are treated as one network-layer packet. The extra 
bits are calculated as a function of the information bits in such a way that, when splitting the 
B-bit packet into multiple equal-size nonoverlapping blocks, the initial D-bit packet can be 
reconstructed, given any subset of these blocks with a total size of D or more bits. First, 
we define the overhead factor r — B/D — b/d where b and d take integer values and the 
fraction b/d cannot be further simplified. One should note that i/r would be equivalent to 
coding gain in channel coding theory. 

Next we define the vector v = {t;, }, where u,- is the number of equal-size blocks that is 
allocated to path i . Some of the paths may demonstrate such a poor performance that there 
is no point in using them at all. This means that we might require using only some of the 
available paths. If n is the number of the paths we have to use in order to maximize P, it 
would be preferable to define the block allocation vector v = {u,} as a vector of a variable 
size n, instead of fixing its size to the number of available paths n max . 

Given the fact that the probability failure vector is ordered from the best path to the 
worst one, a decision to use n paths implies that these paths will be the first n ones. 
Based on these observations, the allocation vector v = {u,} has the following form: v = 
{v u v 2 ,...,v„], n < n m . dx . 

If the block size is w, then w Y11=i Vi — B — rD. Therefore, the total number of 
blocks that the B-bit packet is fragmented into is a — YTi=\ v i — r D/w. From p t < p i+ \ 
it follows that u, > because a path with higher failure probability cannot be as- 
signed fewer blocks than a path with a lower failure probability. The original £>-bit packet 
is fragmented into A/w-size blocks, d\, di, d-$, . . . , d N , and the C-bit overhead packet 
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into M vv-size blocks, c\, C2, C3, . . . , Cm- Based on this we have N — D/w — a/r and 
M — C/w — (r — 1)N — (r — l)a/r.Path 1 will be assigned the first vi blocks of the B -bit 
sequence, path two will receive the next V2 blocks, and so on. Thus, path (' will be assigned 
Vi blocks, each block of size w. Like parity check bits in error correcting (N + M, M) block 
coding, the overhead symbols are generated as linear combination of the original packets 
as 

N 

Cj =J2PiA-> 1<J<M (13.1) 

i=l 

where multiplication and summation are performed in Galois Fields GF(2' n ). The relations 
between probability of successful packet transmission P, parameters N and M and link 
failure probabilities are available from coding theory and will not be repeated here. One 
of the important results from that theory is that the block size has to satisfy the following 
inequality, so that the original information can be recovered [45]: 

w > \\og 2 (N + M + 1)] > log 2 ( fl + 1) (13.2) 

By incorporating the previous definitions in Equation (13.2), we obtain an inequality for 
the number of blocks, into which we can split the B-bit packet 

B > a log 2 (a + 1) = B min (13.3) 



13.5 CLUSTERING PROTOCOLS 
13.5.1 Introduction 

In dynamic cluster-based routing, described so far in this chapter, the network is dynamically 
organized into partitions called clusters, with the objective of maintaining a relatively stable 
effective topology [70]. The membership in each cluster changes over time in response to 
node mobility and is determined by the criteria specified in the clustering algorithm. In 
order to limit far-reaching reactions to topology dynamics, complete routing information 
is maintained only for intracluster routing. Intercluster routing is achieved by hiding the 
topology details within a cluster from external nodes and using hierarchical aggregation, 
reactive routing or a combination of both techniques. The argument made against dynamic 
clustering is that the rearrangement of the clusters and the assignment of nodes to clus- 
ters may require excessive processing and communications overhead, which outweigh its 
potential benefits. If the clustering algorithm is complex or cannot quantify a measure of 
cluster stability, these obstacles may be difficult to overcome. A desirable design objec- 
tive for an architectural framework capable of supporting routing in large ad hoc networks 
subject to high rates of node mobility incorporates the advantages of cluster-based rout- 
ing and balances the tradeoff between reactive and proactive routing while minimizing the 
shortcomings of each. Furthermore, the consequences of node mobility suggest the need 
to include a quantitative measure of mobility directly in the network organization or path 
selection process. 

Specifically, a strategy capable of evaluating the probability of path availability over time 
and of basing clustering or routing decisions on this metric can help minimize the reaction 
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to topological changes. Such a strategy can limit the propagation of far-reaching control 
information while supporting higher quality routing in highly mobile environments. 

In this section we present the (c,f) cluster framework, which defines a strategy for dy- 
namically organizing the topology of an ad hoc network in order to adaptively balance the 
tradeoff between proactive and on demand-based routing by clustering nodes according to 
node mobility. This is achieved by specifying a distributed asynchronous clustering algo- 
rithm that maintains clusters which satisfy the (c,f) criteria that there is a probabilistic bound 
c on the mutual availability of paths between all nodes in the cluster over a specified interval 
of time t. In order to evaluate the (c,t) criteria, a mobility model is used that characterizes 
the movement of nodes in large ad hoc networks. It is shown how this model is used to 
determine the probability of path availability when links are subject to failure due to node 
mobility. 

Based on the (c,t) cluster framework, intracluster routing requires a proactive strategy, 
whereas intercluster routing is demand-based. Consequently, the framework specifies an 
adaptive-hybrid scheme whose balance is dynamically determined by node mobility. In 
networks with low rates of mobility, (c,t) clustering provides an infrastructure that is more 
proactive. This enables more optimal routing by increasing the distribution of topology 
information when the rate of change is low. When mobility rates become very high, cluster 
size will be diminished and reactive routing will dominate. The (c,t) cluster framework 
decouples the routing algorithm specification from the clustering algorithm, and thus, it is 
flexible enough to support evolving ad hoc network routing strategies described so far in 
both the intra- and intercluster domains. 

Several dynamic clustering strategies have been proposed in the literature [70-73]. These 
strategies differ in the criteria used to organize the clusters and the implementation of the 
distributed clustering algorithms. McDonald and Znati [74] use prediction of node mobility 
as a criteria for cluster organization. Clustering decisions in [70-73] are based on static 
views of the network at the time of each topology change. Consequently, they do not 
provide for a quantitative measure of cluster stability. In contrast, the (c, t) cluster strategy 
[74] forms the cluster topology using criteria based directly on node mobility. According 
to Ramanathan and Steenstrup [73], the ability to predict the future state of an ad hoc 
network comprising highly mobile nodes is essential if the network control algorithms 
are expected to maintain any substantive QoS guarantees to real-time connections. The 
multimedia support for wireless network (MMWN) system proposed by Ramanathan and 
Steenstrup [73] is based upon a hybrid architecture that includes the characteristics of 
ad hoc and cellular networks. Their framework uses hierarchical routing over dynamic 
clusters that are organized according to a set of system parameters that control the size of 
each cluster and the number of hierarchical levels. Aggregation of routing information is 
used to achieve scalability and limit the propagation of topological change information. A 
multilevel strategy is used to repair virtual circuit (VC) connections that have been disturbed 
due to node mobility. MMWN does not predict node movement. Consequently, it is unable 
to provide a quantitative bound on the stability of its cluster organization. 

Vaidya et al. [72] proposed a scheme that dynamically organizes the topology into k 
clusters, where nodes in a cluster are mutually reachable via A: -hop paths. The algorithm 
considers k — 1 and reduces to finding cliques in the physical topology. Using a first-fit 
heuristic, the algorithm attempts to find the largest cliques possible. Although the algorithm 
does not form optimal clusters, it still requires a three-pass operation each time a topology 
change occurs: one for finding a set of feasible clusters, a second for choosing the largest 
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of the feasible clusters that are essential to maintain cluster connectivity, and a third to 
eliminate any existing clusters that are made superfluous by the new clusters. 

The objective of the scheme proposed by Lin and Gerla [70] differs significantly from 
the previous examples. Rather than using clustering to minimize the network's reaction to 
topological changes, their scheme is intended to provide controlled access to the bandwidth 
and scheduling of the nodes in each cluster in order to provide QoS support. Hierarchical 
routing and path maintenance were a secondary concern. The proposed algorithm is very 
simple and uses node ID numbers to deterministically build clusters of nodes that are 
reachable by two-hop paths. 

The zone routing protocol (ZRP), described in Section 13.2 is a hybrid strategy that 
attempts to balance the tradeoff between proactive and reactive routing. The objective of 
ZRP is to maintain proactive routing within a zone and to use a query-response mechanism 
to achieve interzone routing. In ZRP, each node maintains its own hop-count constrained 
routing zone; consequently, zones do not reflect a quantitative measure of stability, and the 
zone topology overlaps arbitrarily. These characteristics differ from (c,t) clusters, which 
are determined by node mobility and do not overlap. Both strategies assume a proactive 
routing protocol for intrazone/cluster routing, and each organizes its topology based upon 
information maintained by that protocol. ZRP also defines the query control scheme to 
achieve interzone routing. Although ZRP is not a clustering algorithm and the (c,f) cluster 
framework is not a routing protocol, the comparison demonstrates a close relationship that 
could be leveraged by incorporating the (c, t) cluster into ZRP. The use of (c, t) clusters in ZRP 
could achieve more efficient and adaptive hybrid routing without significantly increasing 
its complexity. 

13.5.2 Clustering algorithm 

The objective of the clustering algorithm is to partition the network into several clusters. 
Optimal cluster size is dictated by the tradeoff between spatial reuse of the channel (which 
drives toward small sizes) and delay minimization (which drives toward large sizes). Other 
constraints also apply, such as power consumption and geographical layout. Cluster size is 
controlled through the radio transmission power. For the cluster algorithm, we assume that 
transmission power is fixed and is uniform across the network. Within each cluster, nodes 
can communicate with each other in at most two hops. The clusters can be constructed 
based on node ID. 

The following algorithm partitions the multihop network into some nonoverlapping clus- 
ters. The following operational assumptions underlying the construction of the algorithm in 
a radio network are made. These assumptions are common to most radio data link protocols 
[75-78]: 

(Al) Every node has a unique ID and knows the IDs of its one-hop neighbors. This 
can be provided by a physical layer for mutual location and identification of radio 
nodes. 

(A2) A message sent by a node is received correctly within a finite time by all of its 
one-hop neighbors. 

(A3) Network topology does not change during the algorithm execution. 

The distributed clustering algorithm is shown in Figure 13.37 
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T: the set of ID's of my one-hop neighbors and myself 

{ 

if (my_id == min(r)) 

{ 

my_cid = my_id; 

broadcast cluster(my_id,my_cid); 

T = T - {my_id}; 

} 

for (:;) 
{ 

on receiving clustered, cid) 

{ 

set the cluster ID of node id to cid; 
if (id==cid and (my_cid==UNKNOWN or my_cid>cid)) 
my_cid = cid; 

r = r-{id} ; 

if (my_id == min( r )) 

{ 

if (my_cid==UNKNOWN) my_cid = my_id; 
broadcast cluster(my_id,my_cid); 
r = T - {my_id}; 

} 

} 

if(r==0)stop; 

} 

} 

Figure 13.37 Distributed clustering algorithm (cluster ID - cid). (Reproduced by permis- 
sion of IEEE [70].) 

As an example, the topology from Figure 13.38 after clustering is given in Figure 13.39. 
From Figures 13.37-13.39 one can see that the cluster ID of each node is equal to either its 
node ID or the lowest cluster ID of its neighbors. Every node must have its cluster ID once 
it becomes the lowest ID node in its locality. This cluster ID will be broadcast at this time, 
and will not be changed before the algorithm stops. Hence, every node can determine its 
cluster and only one cluster. 




Figure 13.38 System topology. 
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13.5.3 Clustering with prediction 
13.5.3. 1 (c, t) Cluster framework 

The objective of the (c,t) cluster framework is to maintain an effective topology that adapts 
to node mobility so that routing can be more responsive and optimal when mobility rates 
are low and more efficient when they are high. This is accomplished by a simple distributed 
clustering algorithm using a probability model for path availability as the basis for cluster- 
ing decisions. The algorithm dynamically organizes the nodes of an ad hoc network into 
clusters where probabilistic bounds can be maintained on the availability of paths to cluster 
destinations over a specified interval of time. 

The (c,t) cluster framework can also be used as the basis for the development of adaptive 
schemes for probabilistic QoS guarantees in ad hoc networks. Specifically, support for QoS 
in time-varying networks requires addressing: (1) connection-level issues related to path 
establishment and management to ensure the existence of a connection between the source 
and the destination; and (2) packet-level performance issues in terms of delay bounds, 
throughput and acceptable error rates. 

Ideally, it is desirable to guarantee that the QoS requirements of ongoing connections 
are preserved for their entire duration. Unfortunately, this is not possible in a time-varying 
network environment as connections may fail randomly due to user mobility. 

A more realistic and practical approach is to provide some form of probabilistic QoS 
guarantees by keeping connection failures below a prespecified threshold value and by 
ensuring with high probability that a minimum level of bandwidth is always available to 
ongoing connections. 

The basic idea of the (c,f) cluster strategy is to partition the network into clusters of nodes 
that are mutually reachable along cluster internal paths that are expected to be available for 
a period of time t with a probability of at least c. The union of the clusters in a network must 
cover all the nodes in the network. Assume, without loss of generality, that t is identical at 
every node in a cluster. If the cluster's topology remains stable over the interval of length t , 
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then routing will be deterministic during this interval, and standard assumptions permit 
the ad hoc network to be modeled as a network of Jackson queues. Assuming that path 
availability is an ergodic process, c represents the average proportion of time a (c,f) path is 
available to carry data. Consequently, c places a lower bound on the effective capacity of 
the path over an interval of length t . 

Let the link capacity be C b/s and the mean packet length b. The effective packet 
service rate /z e ff over the interval t can be determined based upon the path availability 
according to Equation (13.4). Based on the Jackson model, each node can be treated as an 
independent M/M/l queue. Using knowledge of the current aggregate arrival rate X and the 
effective service rate /x ef f, the M/M/l results can be applied to find the mean total packet 
delay T. Since this delay must be less than t, this approach establishes a lower bound on 
the path availability, as shown in Equation (13.7) 

/Xeff = cCii (13.4) 

1 (13.5) 



Meff — A, 

t > -j^— (13.6) 

cCfl — X 

1 + Xt 

c>—77- (13-7) 

fjLtC 

An effective adaptive strategy for determining the value of c controls the minimum level 
of cluster stability required to support the traffic load and QoS requirements of established 
connections. The choice of the parameter t is a system design decision that determines the 
maximum cluster size achievable for different rates of mobility when no traffic exists in the 
network. 



13.5.3.2 (c,t) Cluster algorithm 

There are five events which drive the (c,f) cluster algorithm, namely, node activation, link 
activation, link failure, expiration of the timer and node deactivation. 



Node activation 

The primary objective of an activating node is to discover an adjacent node and join its 
cluster. In order to accomplish this, it must be able to obtain topology information for the 
cluster from its neighbor and execute its routing algorithm to determine the (c,t) availability 
of all the destination nodes in that cluster. The source node can join a cluster if and only if 
all the destinations are reachable via (c,t) paths. The first step upon node activation is the 
initialization of the source node's CID (cluster ID) to a predefined value that indicates its 
unclustered status. The network-interface layer protocol is required to advertise the node's 
CID as part of the neighbor greeting protocol [79] and in the header of the encapsulation 
protocol. This enables nodes to easily identify the cluster status and membership of neigh- 
boring nodes and of the source of the routing updates - a necessary function to control the 
dissemination of routing information. When its network-interface layer protocol identifies 
one or more neighboring nodes, the source node performs the following actions. First, the 
source node identifies the CIDs associated with each neighbor. Next, it evaluates the link 
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availability associated with each neighbor according to either a system default mobility 
profile or mobility information obtained through the network-interface layer protocol or 
physical-layer sensing. The precise methodology and the information required for the eval- 
uation of link availability is described later in this section. 

Finally, the neighbors, having discovered the unclustered status of the source node, 
automatically generate and transmit complete cluster topology information, which they 
have stored locally as a result of participating in the cluster's intracluster routing protocol. 
This topology synchronization function is a standard feature of typical proactive routing 
protocols when a router discovers the activation of a link to a new router. The source node 
does not immediately send its topology information to any of the neighbors. 

Link activation 

A link activation detected by a clustered node that is not an orphan is treated as an intracluster 
routing event. Hence, the topology update will be disseminated throughout the cluster. 
Unlike reactive routing that responds after path failure, the dissemination of link activation 
updates is a key factor to an (c,t) cluster node's ability to find new ( c,t) paths in anticipation 
of future link failures or the expiration of the timer. 

Link failure 

The objective of a node detecting a link failure is to determine if the link failure has caused 
the loss of any (c,f) paths to destinations in the cluster. A node's response to a link failure 
event is twofold. First, each node must update its view of the cluster topology and re-evaluate 
the path availability to each of the cluster destinations remaining in the node's routing table. 
Second, each node forwards information regarding the link failure to the remaining cluster 
destinations. 

Expiration of c timer 

The c timer controls cluster maintenance through periodic execution of the intracluster 
routing algorithm at each node in a cluster. Using the topology information available at 
each node, the current link availability information is estimated and maximum availability 
paths are calculated to each destination node in the cluster. If any of the paths are not (c,t) 
paths, then the node leaves the cluster. 

Node deactivation 

The event of node deactivation encompasses four related events, namely, graceful deacti- 
vation, sudden failure, cluster disconnection and voluntary departure from the cluster. In 
general, each of these events triggers a response by the routing protocol. As a result, nodes 
determine that the node that has deactivated is no longer reachable. 

13.5.3.3 Ad hoc mobility model 

The random ad hoc mobility model used in this section is a continuous-time stochastic 
process, which characterizes the movement of nodes in a two-dimensional space. Based on 
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Figure 13.40 Ad hoc mobility model node movement: (a) epoch random mobility vectors; 
(b) ad hoc mobility model node movement. 



the random ad hoc mobility model, each node's movement consists of a sequence of random 
length intervals called mobility epochs during which a node moves in a constant direction 
at a constant speed. The speed and direction of each node varies randomly from epoch to 
epoch. Consequently, during epoch i of duration , node n moves a distance of VJj T„ in 
a straight line at an angle of 6' n . The number of epochs during an interval of length t is the 
discrete random process N„(t). Figure 13.40(a) illustrates the movement of the node over 
six mobility epochs, each of which is characterized by its direction, 9' n , and distance V l n T l n . 
The mobility profile of node n moving according to the random ad hoc mobility model 
requires three parameters: X„, fi„ and er n 2 . The following list defines these parameters and 
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states the assumptions made in developing this model: 

(1) The epoch lengths are identically, independently distributed (i.i.d.) exponentially 
with mean l/X n . 

(2) The direction of the mobile node during each epoch is i.i.d. uniformly distributed 
over (0, 2it) and remains constant only for the duration of the epoch. 

(3) The speed during each epoch is an i.i.d. distributed random variable (e.g. i.i.d. 
normal, i.i.d. uniform) with mean ji n and variance and remains constant only for 
the duration of the epoch. 

(4) Speed, direction and epoch length are uncorrected. 

(5) Mobility is uncorrected among the nodes of a network, and links fail independently. 

Nodes with limited transmission range are assumed to experience frequent random changes 
in speed and direction with respect to the length of time a link remains active between two 
nodes. Furthermore, it is assumed that the distributions of each node's mobility character- 
istics change slowly relative to the rate of link failure. Consequently, the distribution of the 
number of mobility epochs is stationary and relatively large while a link is active. Since 
the epoch lengths are i.i.d. exponentially distributed, N n (t) is a Poisson process with rate 
X n . Hence, the expected number of epochs experienced by node n during the interval (0,f ) 
while a link is active is X n t — 1. 

These assumptions reflect a network environment in which there are a large number of 
heterogeneous nodes operating autonomously in an ad hoc fashion, which conceptually 
reflects the environment considered in the design of the (c,f) cluster framework. In order to 
characterize the availability of a link between two nodes over a period of time (fo, ?o + 1), 
the distribution of the mobility of one node with respect to the other must be determined. 
To characterize this distribution, it is first necessary to derive the mobility distribution of a 
single node in isolation. The single node distribution is extended to derive the joint mobility 
distribution that accounts for the mobility of one node with respect to the other. Using this 
joint mobility distribution, the link availability distribution is derived. 

The random mobility vector can be expressed as a random sum of the epoch random 
mobility vectors 

NM 

R„(f) = £ K 

1=1 

as shown in Figure 13.40(b). Let be R„(r) the resulting random mobility vector of a mobile 
node which is located at position [X(to), Y(to)] at time to and moves according to a random 
ad hoc mobility profile, \X„, fi„, cr%). The phase of the resultant vector R„(f) is uniformly 
distributed over (0, 2jt) and its magnitude represents the aggregate distance moved by the 
node and is approximately Raleigh distributed with parameter 

a„ = (2t/k„) {pi + fil) 

Pr(9 n < (p) = <p/2n, 0 < (p < lit (13.8) 
MR n (t) < r] «s 1 - exp(-r 2 /a„), 0 < r < oo (13.9) 

The derivation of these distributions is an application of the classic theory of uniform random 
phasor sums [80] that applies central limit theorem to a large number of i.i.d. variables. 
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Joint node mobility 

Based on the assumption of random link failures, we can consider the mobility of two 
nodes at a time by fixing the frame of reference of one node with respect to the other. 
This transformation is accomplished by treating one of the nodes as if it were the base 
station of a cell, keeping it at a fixed position. For each movement of this node, the other 
node is translated an equal distance in the opposite direction. So, the vector R m ^„(t) — 
Rm(0 — R«(0> representing the equivalent random mobility vector of node m with respect 
to node n, is obtained by fixing m's frame of reference to w's position and moving m relative 
to that point. Its phase is uniformly distributed over (0, 2it) and its magnitude has Raleigh 
distribution with parameter a mi „ — a m + a n . 

Random ad hoc link availability 

If L m ,n(t) — 1 denotes an active and L m n (t) — 0 an inactive link, then for nodes n and m, 
link availability is defined as 



Note that a link is still considered available at time t even if it experienced failures during one 
or more intervals (f,-, tj); to < tj < tj < to + t. By definition, if m lies within the circular 
region of radius R centered at n, the link between the two nodes is considered to be active. 

Depending on the initial status and location of nodes m and n, two distinct cases of link 
availability can be identified. 

(1) Node activation - node m becomes active at time to, and it is assumed to be at a 
random location within range of node n. In this case we have 



(2) Link activation: node m moves within range of node n at time t 0 by reaching the 
boundary defined by R, and it is assumed to be located at a random point around the 
boundary. In this case we have 



Random ad hoc path availability 

Let n (t) indicate the status of path k from node n to node m at time t.P^ n (t) — lif all 
the links in the path are active at time t , and P* (t ) — 0 if one or more links in the path are 
inactive at time t. The path availability 7T* „(f) between two nodes n and m at time t > f 0 
is given by the following probability 



A B , B (0 = Pr[L m ,„(f 0 + f) = 1 1 L B .»(<b) = 1] 



(13.10) 




(13.11) 




(13.12) 



= Pr{ 



pijt« +0=1 1 p*jh = i) } = n A wfo + o 



(13.13) 
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If Tt^ n (t) is the path availability of path k from node n to node m at time t, then path k is 
defined as an (c,t) path if and only if 

<„(0 > c (13.14) 

Node m and node m are (c,f) available if they are mutually reachable over (c,t) paths. An 
(c,t) cluster is a set of (c,t) available nodes. This definition states that every node in an (c,f) 
cluster has a path to every other node in the cluster that will be available at time to + t with 
a probability > c. 

Path availability cost calculation 

The above discussion demonstrates how the link availability can be calculated, thereby 
providing a link metric that represents a probabilistic measure of path availability. This 
metric can be used by the routing algorithm in order to construct paths that support a lower 
bound c on availability of a path over an interval of length t. The availabilities of each of 
the links along a path are used by the (c,t) cluster protocol to determine if the path is an 
(c,t) path, and consequently, if a cluster satisfies the (c,f) criteria. In order to support this 
functionality in an ad hoc network, the routing protocol must maintain and disseminate the 
following status information for each link: 

(1) the initial link activation time, to; 

(2) the mobility profiles for each of the adjacent nodes (A ; , /x ; , af) , i — m,n; 

(3) the transmission range of each of the adjacent nodes, R; 

(4) the event which activated the link: (a) node activation at time to or (b) nodes moving 
into range of each other at time to. 

Based on this information, any node in an (c,t) cluster can estimate, at any time z, the 
availability of a link at time t + z. This can be achieved because each node knows the initial 
link activation time to; hence, link availability is evaluated over the interval (to, t + z). 
Nodes can use conditional probability to evaluate the availability of their own links because 
they have direct knowledge of such a link's status at time z, whereas remote nodes do not. 
Specifically, for an incident link that activated at time to, a node will evaluate the availability 
at time t, given that it is available at time z > fo- 

13.5.3.4 Performance example 

A range of node mobility with mean speeds between 5.0 and 25.0 km/h was simulated in 
McDonald and Znati [74] . The speeds during each mobility epoch were normally distributed, 
and the direction was uniformly distributed over (0, 2jz). A node activation rate of 250 
nodes/h was used. The mean time to node deactivation was 1 h. Nodes were initially 
randomly activated within a bounded region of 5 x 5 km. Nodes that moved beyond this 
boundary were no longer considered to be part of the ad hoc network and were effectively 
deactivated. (c,t) path availability was evaluated using Dijkstra's algorithm. 

For each simulation run, data was collected by sampling the network status once per 
second over an observation interval of 1 h. The first 2 h of each run were discarded to 
eliminate transient effects, and each simulation was rerun 10 times with new random seeds. 
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Simulation results for cluster size and cluster survival times are given in Figures 13.40 and 
and 13.41. Finally logical relationships among MANET network-layer entities is given in 
Figure 13.42. 



13.6 CASHING SCHEMES FOR ROUTING 



A large class of routing protocols for MANETs, namely reactive protocols, employ some 
form of caching to reduce the number of route discoveries. The simplest form of caching 
is based on timeouts associated with cache entries. When an entry is cached, a timer starts. 
When the timeout elapses, the entry is removed from the cache. Each time the entry is used, 
the timer restarts. Therefore, the effectiveness of such a scheme depends on the timeout 




Figure 13.41 Simulation results: (a) cluster size (R = 1000 m); (b) cluster size (R = 
500 m); (c) cluster survival (R = 1000 m); and (d) cluster survival (R — 
500m). (Reproduced by permission of IEEE [74].) 




Figure 13.41 Continued. 
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Figure 13.42 Logical relationships among MANET network-layer entities. 
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value associated with a cached route. If the timeout is well-tuned, the protocol performance 
increases; otherwise, a severe degradation arises as entries are removed either prematurely 
or too late from the cache. 

13.6.1 Cache management 

A cache scheme is characterized by the following set of design choices that specify cache 
management in terms of space (cache structure) and time (i.e. when to read/add/ delete a 
cache entry): store policy, read policy, writing policy and deletion policy. 

The store policy determines the structure of the route cache. Recently, two different cache 
structures were studied [81], namely link cache and path cache, and applied to DSR. In a 
link cache structure, each individual link in the routes returned in RREP packets is added 
to a unified graph data structure, managed at each node, that reflects the node's current 
view of the network topology. In so doing, new paths can be calculated by merging route 
information gained from different packets. In the path cache, however, each node stores a 
set of complete paths starting from itself. The implementation of the latter structure is easier 
compared with the former, but it does not permit inference of new routes and exploitation 
of all topology information available at a node. 

The reading policy determines rules of using a cache entry. Besides the straightforward 
use from the source node when sending a new message, several other strategies are possible. 
For example, DSR defines the following policies: 

• cache reply - an intermediate node can reply to a route request with information stored 
in its own cache; 

• salvaging - an intermediate node can use a path from its own cache when a data packet 
meets a broken link on its source route; 

• gratuitous reply - a node runs the interface in the promiscuous mode and it listens for 
packets not directed to itself. If the node has a better route to the destination node of a 
packet, it sends a gratuitous reply to the source node with this new better route. 

The writing policy determines when and which information has to be cached. Owing to 
the broadcast nature of radio transmissions, it is quite easy for a node to learn about new 
paths by running its radio interface in the promiscuous mode. The main problem for the 
writing policy is indeed to cache valid paths. Negative caches are a technique proposed in 
Johnson and Maltz [82] and adapted in Marina and Das [83] to filter the writing of cache 
entries in DSR out. A node stores negative caches for broken links seen either via the route 
error control packets or link layer for a period of time of St s. Within this time interval, the 
writing of a new route cache that contains a cached broken link is disabled. 

The deletion policy determines which information has to be removed from the cache 
and when. Deletion policy is actually the most critical part of a cache scheme. Two kinds 
of 'errors' can occur, owing to an imprecise erasure: (1) early deletion, a cached route is 
removed when it is still valid; and (2) late deletion, a cached route is not removed even if 
it is no longer valid. 

The visible effect of these kinds of errors is a reduction in the packet delivery fraction 
and an increase in the routing overhead (the total number of overhead packets) [84]. Late 
deletions create the potential risk of an avalanche effect, especially at high load. If a node 
replies with a stale route, the incorrect information may be cached by other nodes and, in 
turn, used as a reply to a discovery. Thus, cache 'pollution' can propagate fairly quickly [83]. 
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Caching schemes in DSR 

All such schemes rely on a local timer-based deletion policy [81, 84]. The only exception 
has been proposed in Marina and Das [83]. They introduce a reactive caching deletion 
policy, namely, the wider error notification, that propagates route errors to all the nodes, 
forcing each node to delete stale entries from its cache. 

Simulation results reported in References [81, 83] show that performance of a timer- 
based caching deletion policy is highly affected by the choice of the timeout associated 
with each entry. In the path cache, for a value of timeout lower than the optimal one (i.e. 
early deletion), the packet delivery fraction and routing overhead are worse than caching 
schemes that do not use any timeout. In the link cache, late deletion errors increase the 
routing overhead while the packet delivery fraction falls sharply. 

The cache timeout can obviously be tuned dynamically. However, adaptive timer-based 
deletion policies have their own drawbacks. This policy suffers from late or early deletion 
errors during the transition time from one optimal timeout value to the successive one. So, 
the more the network and the data load are variable, the worse the performance will be. 

To reduce the effect of such imprecise deletions, the adaptive timer-based cache scheme 
has been combined with the wide error notification deletion technique and studied for DSR 
in Perkins and Royer [17]. According to such a combined scheme, routes that become stale 
before their timeout expiration are removed reactively from all the sources using that route. 
In this combined technique, however, two more points remain unresolved: (1) Owing to the 
reactive nature of the deletions, if a cache entry is not used, it remains in the cache, even if 
no longer valid, thus it can be used as a reply to a path discovery and (2) the effect of early 
deletions cannot be avoided. 

Caching schemes in ZRP 

The caching zone with radius k* for a cache leader n is defined as the set of nodes at a 
distance of at most k* hops from n. An active path is created as a result of the discover 
phase and it is composed of a set of nodes, referred to as active nodes, forming a path from a 
source node S to a destination node D. Cache leader nodes are a subset of the active nodes. 

The key consideration is to avoid the possibility that nodes can cache route information 
autonomously. Therefore, a cache leader n is the only node that is authorized to advertise 
route information inside its caching zone which is written into caches. On receiving the 
advertising message, a node proactively maintains a path to n so that it can be used as the 
next-hop node to any of the advertised routes. A cache leader is responsible for the validity 
of the advertised routes. Thus, it monitors such routes and forces each node in its caching 
zone to remove a route as soon as it becomes stale, so the deletion policy is proactive. Let 
us note that, if we consider k* — k and each node of a ZRP interzone path as a cache leader, 
we get the same underlying zone structure of ZRP (this implies that each active node is a 
cache leader). However, more generally, a cache leader can decide to advertise paths only to 
those nodes located at a distance k* < k, and not all active nodes need to be cache leaders. 

Implementation of C-ZRP 

For simplicity, the implementation assumes: 

(1) k = k*. 

(2) All active nodes act as cache leader nodes and vice versa. 
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(3) Only paths to active nodes are advertised as external routes. 

(4) Caches are managed using explicit injection/deletion messages. 

(5) To stop redundant query threads, LT (loop back termination), QD2 (query detec- 
tion) and ET (early termination) redundant filtering rules are used which have been 
described earlier in this chapter. 

When a node S, in Figure 13.43, executes a route request for a node D, an interzone 
path from S to D is identified. A node B,- belonging to an interzone path is an active node 
for the caching scheme. In Figure 13.43 an interzone path between S and D is formed by 
nodes b, e, p and t. Thus, an interzone path is also an active path. An interzone path is stored 
according to a distributed next-hop fashion, where the next-hop node is an active node. B, 
stores B,-+i as the next-hop active node for all the downstream nodes from B, + 2 to Bm+i 
and B,_! as the next-hop active node for all the upstream nodes from Bq to B,_2- These two 




Figure 13.43 An example of values of data structures used in C-ZRP, k — 2. 
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active nodes will be referred to as companion nodes (as an example, the companion nodes 
of node b, with respect to the interzone path from S to D are S and e). 

All routing information concerning nodes belonging to an interzone path is advertised 
inside the caching zone of each member of the path, which thus acts as cache leader for that 
information. Such routes are then maintained proactively by the IARP. If a new node joins 
B, 's zone, it acquires, by means of the IARP, all previously advertised routing information 
by B, . Since a node may belong to more than one overlapping zone, it can acquire more 
than a single path to the same destination. 

When a node, say B, + i , leaves B, 's routing zone, not all the routing information gathered 
during the route request/reply is lost. Roughly speaking, two active paths from S to B,_i and 
from B, + i to D are still up. Hence, all the routing information concerning these subpaths is 
still valid. However, nodes Bo, . . . ,B;-i (B,-_i, . . . ,Bm+i) notify the nodes inside their own 
zones, using a delete control message, that the destinations B,_i, . . . ,Bm+i (#o, ■ ■ • 3;) are 
no longer reachable. 



Data structures 

Each node X uses the following local data structures: 

• Internal zone routing table (IZT) - an entry of IZT is a triple (d, n, #h), where d is the 
destination node, n is the next-hop node (located in X's transmission range), and #h is 
the path cost in number of hops. 

• External zone routing table (EZT) - a row of EZT is a triple (d, n, #z), where d is the 
destination node, n is the next-hop active node (n belongs to X's routing zone and is not 
restricted to be in its transmission range), and #z is the cost of the path from X to d, given 
as the number of active nodes that have to be traversed. For example, in Figure 13.43, 
node b sets node e as the next-hop active node for p with cost two (nodes e and p). 

• Interzone path table (IZP) - an interzone path corresponds to an entry in X's IZP table 
provided that X is an active node and (X ^ S,D). In this case, let the path id be ID and 
X = B,-. The entry is the triple (ID, B,-_i, B i+i ). 

• Reachable nodes (RN) list - this is a sequence of pairs (d,#z), where d is an active node 
belonging to an interzone path and #z is the cost of the path from X expressed as number of 
active nodes that must be traversed to reach d. A node X advertises RN to nodes belonging 
to Zfc(X). RN includes the projection of EZT along the first and third components. For 
example, node b of Figure 13.43 will include the pairs (p, 2), (t, 3), and (D, 4) in RN. 

• Unreachable nodes (UN) set - this set of nodes is used to advertise destinations that 
become unreachable. 



Interzone path creation 

A single interzone path from S to D is created during a route request/reply cycle by allowing 
only the destination D to send a single reply for a given request. The path is tagged with a 
unique identifier ID, for example, obtained by using increasing sequence numbers generated 
by the requesting node. When S triggers a new route discovery for a node D, it bordercasts 
a query message to all its border nodes. The message contains the identifier ID and a route 
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accumulation vector AV[ ], initialized with AV[0] = S. Let M be the number of active nodes 
(not including S and D). 

(1) When a border node X^D receives a query message, if the message is received for 
the first time and the redundant query filter rules are satisfied: 

(a) It adds its own identification into the accumulation vector. As an example, if the 
node X corresponds to node Bj in the interzone path, then AV [j] = X. 

(b) If D belongs to X's routing zone, then the latter unicasts the query message to 
D. Otherwise, it executes a bordercast. 

(2) When the destination node D receives a query message with identifier ID for the 
first time: 

(a) It stores the tuples (AV[i], AV[M], M + 1 - i, for 0 < i < M - 1 in EZT. 

(b) It prepares the list RN = (AV[i], M + 1 - ;')], for 0 < i < M. 

(c) It sets AV[M + 1] = D. 

(d) It sends a reply message to AV[M]. The message contains the AV vector accu- 
mulated in the query message. 

An example of path creation is given in Figure 13.44(a). 

(3) When a border node Bj receives a reply message: 

(a) If Bj ^ S, then it stores the triple (ID,AV[; - 1], AV [j + 1]) in the IZP table, 
thus becoming an active node. 

(b) It stores the following tuples in EZT: (AV [i], AV[/' — 1],;- i),forO < i < j - 2; 
(AV [i], AV[/ +l],j - i), for j +2 < i < M + I. 

AV=[S] AV=[S,B1] AV=[S,B1,B2] 

o^o— o— o — o — o 

S Bl B2 B3 B4 D 
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Figure 13.44 An example of interzone path creation and deletion. 
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(c) It prepares RN = [(AV[/ + i], for -j < i < M + 1. 

(d) If Bj ^ S, then it forwards the reply message to the node AV [j — 1]. 

Figure 13.43(b) shows the state at node B2 after the reception of the reply message with 
AV [S,B1,B2,B3,B4,D] causec j me execut i 0 n of the following actions: 

(1) B2 becomes a member of an interzone path [it stores the triple (ID,Bi33) in IZP]. 

(2) B 2 adds the entries (S,Bi,2), (B 4 ,B 3 , 2), (D,B 3 , 3) in EZT. 

(3) B 2 prepares the list of reachable nodes, RN = [(S,2),(B1,1),(B3,1),(B4, 2),(D,3)]. 

(4) B 2 forwards the reply to Bl. 

Interzone path deletion 

An interzone path is broken at node Bj when Bj_\ (or Bj+i) is no longer in Bj 's routing zone. 
In this case, the path is divided in two subpaths and the source node is notified with an error 
message. An active node Bj executes the following actions (in the following notation '-' 
means any): 

(1) Deletes the entry (— ,Bj-i,— ) or (— ,Bj+i,— ) from EZT. 

(2) Checks for the companion node B J+ i or B 7 -_i in the IZP table. 

(3) If the companion node is found, then it prepares the following list of unreach- 
able nodes: N — [B 0 ,Bi, • • • (UN = [B y+ i,B./ + 2v • • ,Bm+iD; and sends a 
Delete_Path message, containing UN and the path identifier ID, to the companion 
node. 

(4) Deletes the entry (ID,B ; -_i,B from IZP after the successful transmission of the 
message. 

When an active path is broken, the source node either receives the Delete_Path message 
from Bi [if the link is broken between (B^.B^+i). with j > 0], or is able to detect the break 
autonomously via IARP. The source node thus triggers a new route discovery if required 
to send other packets, while the two subpaths (Bo.Bi,- . . ,Bj 1 and B j+i,B j +2 , . . . ,Bm+i) 
remain active. Figure 13.44(c) shows the case when the 'link' between B 2 and B3 is broken 
(i.e. their distance becomes higher than k). Two interzone subpaths, (S,Bi ,B 2 ) and (B3 ,B4,D), 
are generated. In the figure, B 2 's EZT data structure is also shown. 

When an active node receives a Delete_Path message from one of its companion nodes 
X, it deletes the entries stored in the UN list from EZT and forwards the message to the 
other companion node. If the receiving node has some another route to a node stored in UN, 
then it does not include such a node when forwarding UN. 

Cache management 

In order to allow all the nodes of Bj's routing zone to use the acquired information, Bj 
broadcasts RN inside its zone. Such a message is referred to as the inject message. On 
receiving an inject message carrying the reachable node list RN from a node X — Bj, a 
node Y creates a set of entries (RN[/].d,X,RN[i].#z) into its own EZT, 0 < i < |RN|, where 
RN[i].d is the first component (destination node) of the i'th pair of RN, RN[/],#z, the second 
component (i.e. the length), and |RN| is the number of elements of RN. Figure 13.44(a) 
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Figure 13.45 An example of (a) injection and (b) deletion of external nodes. 



shows node B 2 injecting the external routes to nodes S,Bi,B 3 ,B4,D mto j ts zone Note that 
Y now has two routes to node Bi since such a node is in Y' routing zone. 



Deleting external routes 

When a node B ; either detects a path breakage or receives a Delete _Path message, it broad- 
casts a Delete message into its zone containing the list of unreachable nodes UN. When 
an internal node receives a Delete message it deletes all the matching entries from EZT. 
Figure 13.45(b) shows the delete mechanism on node Y . 



13.7 DISTRIBUTED QoS ROUTING 



In this section a distributed QoS routing scheme for ad hoc networks is discussed. Two 
routing problems are presented, delay constrained least cost routing (DCLC) and bandwidth 
constrained least cost routing (BCLC). As before, the path that satisfies the delay (or 
bandwidth) constraint is called a feasible path. The algorithms can tolerate the imprecision 
of the available state information. Good routing performance in terms of success ratio, 
message overhead and average path cost is achieved even when the degree of information 
imprecision is high. Note that the problem of information imprecision exists only for QoS 
routing; all best-effort routing algorithms, such as DSR and ABR, do not consider this 
problem because they do not need the QoS state in the first place. Multipath parallel routing 
is used to increase the probability of finding a feasible path. In contrast to the fiooding-based 
path discovery algorithms, these algorithms search only a small number of paths, which 
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limits the routing overhead. In order to maximize the chance of finding a feasible path, the 
state information at the intermediate nodes is collectively utilized to make intelligent hop- 
by-hop path selection. The logic behind this is very much equivalent to using the Viterbi 
instead of ML (maximum likelihood) algorithm in trellis-based demodulation processes. 

The algorithms consider not only the QoS requirements, but also the optimality of the 
routing path. Low-cost paths are given preference in order to improve the overall network 
performance. In order to reduce the level of QoS disruption, fault-tolerance techniques are 
brought in for the maintenance of the established paths. Different levels of redundancy 
provide tradeoff between the reliability and the overhead. The dynamic path repairing 
algorithm repairs the path at the breaking point shifts the traffic to a neighbor node, and 
reconfigures the path around the breaking point without rerouting the connection along a 
completely new path. Rerouting is needed in two cases. One case is when the primary path 
and all secondary paths are broken. The other case is when the cost of the path grows large 
and hence it becomes beneficial to route the traffic to another path with a lower cost. 

13.7.1 Wireless links reliability 

One element of the cost function will be reliability of the wireless links. The links between 
the stationary or slowly moving nodes are likely to exist continuously. Such links are called 
stationary links. The links between the fast moving nodes are likely to exist only for a short 
period of time. Such links are called transient links. A routing path should use stationary links 
whenever possible in order to reduce the probability of a path breaking when the network 
topology changes. A stationary neighbor is connected to a node with a stationary link. As 
in Chapter 7, delay of path P between two nodes equals the sum of the link delays on the 
path between the two nodes and will be denoted as delay(P). Similarly bandwidth(P) equals 
the minimum link bandwidth on the path P, and cost(P) equals the sum of the link costs. 

13.7.2 Routing 

Given a source node s, a destination node t, and a delay requirement D, the problem of 
delay-constrained routing is to find a feasible path P from s to t such that delay(P) < D. 
When there are multiple feasible paths, we want to select the one with the least cost. Another 
problem is bandwidth-constrained routing, i.e. finding a path P such that bandwidth(P) > 
B, where B is the bandwidth requirement. When there are multiple such paths, the one with 
the least cost is selected. Finding a feasible path is actually the first part of the problem. 
The second part is to maintain the path when the network topology changes. 

13.7.3 Routing information 

The following end-to-end state information is required to be maintained at every node i for 
every possible destination t. The information is updated periodically by a distance- vector 
protocol discussed in Section 13.1: 

(1) Delay variation - AD,(f ) keeps the estimated maximum change of A(f) before the 
next update. That is, based on the recent state history, the actual minimum end-to-end 
delay from ;' to t is expected to be between D,(f) — AZ),(f) and D,-(f) + AD,(f) in 
the next update period. 
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(2) Bandwidth variation - A_B,(f ) keeps the estimated maximum change of B,(f) before 
the next update. The actual maximum bandwidth from i to t is expected to be between 
Bi(t) - ABi(t) and 5,(0 + AB,(f) in the next period. 

(3) The cost metric C,0) is used for optimization, in contrast to the delay and bandwidth 
metrics used in QoS constraints. 

Consider an arbitrary update of AD t (t) and D t (t). Let AZ),(f) and ADj.(f) be the values of 
ADj(t) before and after the update, respectively. Similarly, let D,-(f) and D t {t) be the values 
of Di(t) before and after the update, respectively. D-(f) is provided by a distance-vector 
protocol. AZ) ( '(0 is calculated as follows: 



The factor a(<l) determines how fast the history information AD,-(f) is forgotten, and 
1 — a determines how fast AD t (t) converges to 10,(0 — A (01- By the previous formula, it 
is still possible for the actual delay to be out of the range [A(0 — AD,(f), A(0 + AZ),(?)]. 

One way to make such probability sufficiently small is to enlarge AD,-(f). Hence, we 
shall modify the formula and introduce another factor /3(>1): 



AD;(f)converges to j3 |D ( '(0 — A(0| at a speed determined by 1 — a. 
13.7.4 Token-based routing 

There are numerous paths from s to t. We shall not randomly pick several paths to search. 
Instead, we want to make an intelligent hop-by-hop path selection to guide the search along 
the best candidate paths. This is what, for example, the Viterbi algorithm would be doing in 
trying to avoid search through all possible trajectories of a trellis (ML approach) in a signal 
demodulation/decoding process. 

The basic idea of token-based probing is outlined below. A token is the permission to 
search one path. The source node issues a number of tokens based on the available state 
information. One guideline is that more tokens are issued for the connections with tighter 
requirements. Probes (routing messages) are sent from the source toward the destination to 
search for a low-cost path that satisfies the QoS requirement. Each probe is required to carry 
at least one token. At an intermediate node, a probe with more than one token is allowed to 
be split into multiple ones, each searching a different downstream subpath. The maximum 
number of probes at any time is bounded by the total number of tokens. Since each probe 
searches a path, the maximum number of paths searched is also bounded by the number 
of tokens. See Figure 13.46 for an example. Upon receipt of a probe, an intermediate node 
decides, based on its state: (1) whether the received probe should be split; and (2) to which 
neighbor nodes the probe(s) should be forwarded. The goal is to collectively utilize the 
state information at the intermediate nodes to guide the limited tickets (the probes carrying 
them) along the best paths to the destination, so that the probability of finding a low-cost 
feasible path is maximized. 



A£>j(0 = a AA(0 + (1 - «) A>) - A(0 



(13.15) 



AA>) 



aADKO + (!-<*)£ A>)- A(0 



(13.16) 
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Figure 13.46 Generation of probes p (number of tokens). 
13.7.5 Delay-constrained routing 

When a connection request arrives at the source node, a certain number No of tokens are 
generated, and probes are sent toward the destination t. Each probe carries one or more 
tokens. Since no new tokens are allowed to be created by the intermediate nodes, the total 
number of tokens is always No, and the number of probes is at most No at any time. When a 
node receives a probe p with N{p) tokens, it makes at most N(p) copies of p, distributes the 
received tickets among the new probes, and then forwards them along to selected outgoing 
links toward t. Each probe accumulates the delay of the path it has traversed so far. A probe 
can proceed only when the accumulated delay does not violate the delay requirement. 
Hence, any probe arriving at the destination detects a feasible path, which is the one it has 
traversed. 

There are two basic guidelines for how to determine No and how to distribute the tokens 
in a received probe among the new probes: 

(1) Different numbers of tokens are assigned to different connections based on their 
'needs' . For a connection whose delay requirement is large and can be easily satisfied, 
one token is issued to search a single path; for a connection whose delay requirement 
is smaller, more tokens are issued to increase the chance of finding a feasible path; 
for a connection whose delay requirement is too small to be satisfied, no tokens are 
issued, and the connection is immediately rejected. 

(2) When a node i forwards the received tokens to its neighbors, the tokens are distributed 
unevenly among the neighbors, depending on their chances of leading to reliable 
low-cost feasible paths. A neighbor having a smaller end-to-end delay to the desti- 
nation should receive more tickets than a neighbor having a larger delay; a neighbor 
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having a smaller end-to-end cost to the destination should receive more tokens than a 
neighbor having a larger cost; a neighbor having a stationary link to should be given 
preference over a neighbor having a transient link to i. Note that some neighbors 
may not receive any tokens because i may have only a few or just one token to 
forward. 



13.7.6 Tokens 

The two types of tokens, constraint (limitation) tokens (CT) and optimization tokens (OT) 
have different purposes. CT tokens prefer paths with smaller delays, so that the chance of 
satisfying a given delay (constraints) requirement is higher. OT tokens prefer the paths with 
smaller costs. The overall strategy is to use the more aggressive OT tokens to find a low-cost 
feasible path with relatively low success probability and to use the CT tokens as a backup 
to guarantee a high success probability of finding a feasible path. The number of CT tokens 
L, and OT tokens O is determined based on the delay requirement D. If D is very large and 
can definitely be satisfied, a single CT token will be sufficient to find a feasible path. If D is 
too small to be possibly satisfied, no CT token is necessary, and the connection is rejected. 
Otherwise, more than one CT token is issued to search multiple paths for a feasible one. 
Based on the previous guideline, a linear token curve shown in Figure 13.47(a) was chosen 
in Chen and Nahrstedt [85] for simplicity and efficient computation. 

Parameter <t> is a system parameter specifying the maximum allowable number of CT. It 
shows that more CT are assigned for smaller D. Number of optimization tokens O is also 
determined based on the delay requirement D as shown in Figure 13.47(b). 




Figure 13.47 Token curves. 
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13.7.7 Forwarding the received tokens 
13. 7. 7. 1 Candidate neighbors 

If L + O — 0, the connection request is rejected. Otherwise, probes carrying the tokens 
are sent from s to t. A probe proceeds only when the path has a delay of no more than D. 
Hence, once a probe reaches t, it detects a delay-constrained path. Each probe accumulates 
the delay of the path it has traversed so far. A data field, denoted as delay(p), is defined 
in a probe p. Initially, delay(p)^ 0; whenever p proceeds for another link delay(p) 
=>■ delay(p) + delay(ij). Suppose a node i receives a probe p with L(p) constraint tokens 
and O(p) optimization tokens. Suppose k is the sender of the probe p. The set Rf (t) of 
candidate neighbors, to which i will forward the received tokens, is determined as follows. 
We first consider only the stationary neighbors (V, s ) of /. Let 

Rf(t) = {j \delay{p) + delay(i, j) + Dj(t) - ADj(t) < D, j e V* - {k} } 

Rf (?)is the set of neighbors to which the tickets should be forwarded. If R? (t) — {0}, we 
take the transient neighbors into consideration and redefine Rf (?) to be 

Rf (0 = [j \delay(p) + delay(i, j) + Dj(t) - ADj(t) < D, j e V,- - {k} } 

Ifwe still have Rf(t) = {0}, allreceived tokens are invalidated and discarded. If Rf(t) ^ {0}, 
then for every j e Rf(t), i makes a copy of p, denoted as pj. Let p j have L(pj) constraint 
tokens and O(pj) optimization tokens. These parameters are calculated as 

[delay(i,j) + D j (t)]~ 1 

L(pj) = L — — j x L(p) 

E [delay(i,j') + D r (t)] 



\cos t(i, j) + C jit)] 

O(pj) = L — 1 J x O(p) (13.17) 

E [cos t(i,j') + C r (tj\ 
7"'e*f(0 

These numbers will be rounded to the closest integer. 

The data structure carried by a probe p is shown in Table 13.3. The last six fields, k, 
path, L(p), O(p), delay(p) and cost(p), are modified as the probe traverses. Tokens are logical 
entities, and only the number of tokens is important: there can be at most L(p) + O(p) new 
probes descending from p, among which probes with constraint tokens choose paths based 
on delay, probes with optimization tokens choose paths based on cost, and probes with both 
types of tokens choose paths based on both delay and cost. 



13.7.8 Bandwidth-constrained routing 

The algorithm shares the same computational structure with the delay-constrained routing 
algorithm. The differences are the metric-dependent token curves and token distribution 
formulas. The token curves are given in Figure 13.48. 
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Table 13.3 



id 


System-wide unique identification for the connection request 


s 


Source node 


t 


Destination node 


D 


Delay requirement 


L + 0 


Total number of tokens 


k 


Sender of p 


nnth 


PatVi n Has trnvprspH so far 


L(p) 


Number of constrained tokens carried by p 


O(p) 


Number of optimization tokens carried by p 


delay (p) 


Accumulated delay of the path traversed so far 


cost(p) 


Accumulated cost of the path traversed so far 




B 



B s (t)-AB s (t) B s (t) B s (t)+AB s (t) 




6 \B s (t)-AB s (t)\ B s (t)-AB s (t) B s (t) B s (t)+AB s (t) 



Figure 13.48 Token curves as functions of B. 
13.7.9 Forwarding the received tickets 

Suppose a node i receives a probe p with L(p) constraint tokens and O(p) optimization 
tokens. Suppose k is the sender of the probe p. The set Rf (t) of candidate neighbours, to 
which i will forward the received tokens, is determined as follows. Define 

RP{t) = { ; \bandwidth(i, j) > B A Bj(t) + ABj(t) > BJ e V* - {k} } 
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If Rf (t) — {0}, we take the transient neighbours into consideration and redefine Rf (t) to be 

Rf(t) = {j \bandwidth(i, j) > B A Bj(t) + ABj(t) > BJ e V t - {k} } 

If we still have Rf (f ) = {0} all received tokens are invalidated and discarded. If Rf (t) ^ {0}, 
then for every j e Rf(t), i makes a copy of p, denoted as pj. Let p j have L(pj) constraint 
tokens and O(pj) optimization tokens. L(pj) is determined based on the observation that a 
probe sent toward the direction with a larger residual bandwidth should have more L tokens. 
These parameters are now calculated as 

^ ^ min[bandwidth(i, j), Bj(tj\ ^ ^ 

min[bandwidth(i, j'), Bj'(t)\ 

\cost(iJ) + C j (t)]~ 1 

O(pj) = L — J -i x O(p) (13.18) 

E [cost(i, j')+C r (t)] 



13.7.10 Performance example 

Three performance metrics: (1) success ratio = number of connections accepted/total 
number of connection requests; (2) avg msg overhead = total number of messages sent/total 
number of connection requests; and (3) avg path cost = total cost of all established 
paths/number of established paths, are analysed by simulation in Cher and Nahrstedt [85]. 

Sending a probe over a link is counted as one message. Hence, for a probe that has tra- 
versed a path of / hops, / messages are counted. The network topology used in simulation is 
randomly generated. Forty nodes are placed randomly within a 15 x 15 m area. The trans- 
mission range of a node is bounded by a circle with a radius of 3 m. A link is added between 
two nodes that are in the transmission range of each other. The average degree of a node is 3 .4. 

The source node, the destination node, and parameter D of each connection request are 
randomly generated. D is uniformly distributed in the range of [30, 160 ms]. The cost of 
each link is uniformly distributed in [0, 200]. Each link (j, k) is associated with two delay 
values: delay-old denoted as delay(j,k) and delay-new denoted as delay'(j,k); delay(j,k) is 
the last delay value advertised by the link to the network. Parameter delay '(j,k) is the actual 
delay of the link at the time of routing. Parameter delay (j,k) is uniformly distributed in [0, 50 
ms], while delay'(j,k) is uniformly distributed in [(1 — %)delay(j, k), (1 + ^)delay(j, k)], 
where f is a simulation parameter, called the imprecision rate, defined as 

§ = sup { [delay'(j , k) — delay(j, A:)] /delay(j, k)} 

Three algorithms are simulated: the flooding algorithm, the token-based probing algo- 
rithm (TBP) and the shortest-path algorithm (SP). The flooding algorithm is equivalent 
to TBP with infinite number of constraint tokens and zero optimization tokens. It floods 
routing messages from the source to the destination. Each routing message accumulates 
the delay of the path it has traversed, and the message proceeds only if the accumulated 
delay does not exceed the delay bound. The system parameters of the TBP algorithm are 
<t> = 4, 0 = 1.5 , £2 = 3. The values are obtained by extensive simulation runs. The SP 
algorithm maintains a state vector at each node / by a distance-vector protocol. 
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Figure 13.49 Success ratio [imprecision rate: (a) 5 % and (b) 50%]. (Reproduced by per- 
mission of IEEE [85].) 



Comparison of the performance results shown in Figures 13.49-13.52 demonstrates 
advantages of the TBP. The protocol presented in this section is a brief interpretation 
of Ticket Based Protocol represented in Chen and Nahrstedt [85] with slightly modified 
terminology adjusted to that of the rest of the book. The problem of QoS routing in wireline 
and wireless networks has been attracting much attention in both academia and industry. 
For more information see References [86-88]. A comprehensive overview of the literature 
is given in Chen and Nahrstedt [88]. The problem will be revisited again in Chapter 21 in 
more detail. 
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Figure 13.50 Messages overhead (imprecision rate: 10 %). (Reproduced by permission of 
IEEE [85].) 
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Figure 13.51 Cost per established path (imprecision rate: 5 %). (Reproduced by permission 
of IEEE [85].) 
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Figure 13.52 Cost per established path (imprecision rate: 50%). (Reproduced by permis- 
sion of IEEE [85].) 
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14 



Sensor Networks 



14.1 INTRODUCTION 

A sensor network is composed of a large number of sensor nodes, which are densely 
deployed either inside the phenomenon they are observing, or very close to it. Most of the 
time the nodes are randomly deployed in inaccessible terrains or disaster relief operations. 
This also means that sensor network protocols and algorithms must possess self-organizing 
capabilities. Another unique feature of sensor networks is the cooperative effort of sensor 
nodes. Sensor nodes are fitted with an on-board processor. Instead of sending the raw 
data to the nodes responsible for the fusion, sensor nodes use their processing abilities to 
locally carry out simple computations and transmit only the required and partially processed 
data. 

The above described features ensure a wide range of applications for sensor networks. 
Some of the application areas are health, military and security. For example, the physiologi- 
cal data about a patient can be monitored remotely by a doctor. While this is more convenient 
for the patient, it also allows the doctor to better understand the patient's current condition. 
Sensor networks can also be used to detect foreign chemical agents in the air and the water. 
They can help to identify the type, concentration and location of pollutants. In essence, 
sensor networks will provide the end user with intelligence and a better understanding of 
the environment. It is expected that, in future, wireless sensor networks will be an integral 
part of our lives, even more than the present-day personal computers. 

Realization of these and other sensor network applications require wireless ad hoc net- 
working techniques. Although many protocols and algorithms have been proposed for 
traditional wireless ad hoc networks, as described in Chapter 13, they are not well suited 
for the unique features and application requirements of sensor networks. To illustrate this 
point, the differences between sensor networks and ad hoc networks (see Chapter 13, and 
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references therein) are outlined below [1-5]: 

• sensor nodes are densely deployed; 

• sensor nodes are prone to failure; 

• the number of sensor nodes in a sensor network can be several orders of magnitude higher 
than the nodes in an ad hoc network; 

• the topology of a sensor network changes very frequently; 

• sensor nodes mainly use broadcast communication paradigms whereas most ad hoc 
networks are based on point-to-point communications; 

• sensor nodes are limited in power, computational capacities and memory; 

• sensor nodes may not have global identification (ID) because of the large amount of 
overhead and large number of sensors. 

One of the most important constraints on sensor nodes is the low power consumption re- 
quirement. Sensor nodes carry limited, quite often irreplaceable, power sources. Therefore, 
while traditional networks aim to achieve high QoS provisions, sensor network protocols 
must focus primarily on power conservation. They must have inbuilt trade-off mechanisms 
that give the end user the option of prolonging network lifetime at the cost of lower through- 
put or higher transmission delay. This problem will be the focus of this chapter. 

Sensor networks may consist of many different types of sensors such as seismic, low 
sampling rate magnetic, thermal, visual, infrared, acoustic and radar. These sensors are 
able to monitor a wide variety of ambient conditions that include the current characteristics 
such as speed, direction and size of an object, temperature, humidity, vehicular movement, 
lightning condition, pressure, soil makeup, noise levels, the presence or absence of certain 
kinds of objects, mechanical stress levels on attached objects etc. 

Sensor nodes can be used for continuous sensing, event detection, event ID, location 
sensing, and local control of actuators. The concept of micro-sensing and wireless connec- 
tion of these nodes promises many new application areas. Usually these applications are 
categorized into military, environment, health, home and other commercial areas. It is pos- 
sible to expand this classification with more categories such as space exploration, chemical 
processing and disaster relief. 

Wireless sensor networks can be an integral part of military command, control, com- 
munications, computing, intelligence, surveillance, reconnaissance and targeting (C4ISRT) 
systems. They are used for monitoring friendly forces, equipment and ammunition and bat- 
tlefield surveillance (Figure 14. 1). Sensor networks can be deployed in critical terrains, and 
some valuable, detailed and timely intelligence about the opposing forces and terrain can 
be gathered within minutes before the opposing forces can intercept them. Sensor networks 
can also be incorporated into the guidance systems of intelligent ammunition. 

Sensor networks deployed in the friendly region and used as a chemical or biological 
warning system can provide friendly forces with critical reaction time, which decreases ca- 
sualty numbers drastically. Environmental applications of sensor networks include tracking 
the movements of birds, small animals and insects; monitoring environmental conditions 
that affect crops and livestock; irrigation; macroinstruments for large-scale Earth monitoring 
and planetary exploration; chemical/biological detection; precision agriculture; biological, 
Earth and environmental monitoring in marine, soil and atmospheric contexts; forest fire 
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Figure 14.1 Battlefield surveillance. 

detection; meteorological or geophysical research; flood detection; bio-complexity mapping 
of the environment; and pollution study. 

The health applications for sensor networks are providing interfaces for the disabled, 
integrated patient monitoring, diagnostics, drug administration in hospitals, monitoring 
the movements and internal processes of insects or other small animals, telemonitoring 
of human physiological data, and tracking and monitoring doctors and patients inside a 
hospital. More details on sensor networks applications can be found in References [6-64]. 



14.2 SENSOR NETWORKS PARAMETERS 

A sensor network design is influenced by many parameters, which include fault tolerance, 
scalability, production costs, operating environment, sensor network topology, hardware 
constraints, transmission media and power consumption. The failure of sensor nodes should 
not affect the overall task of the sensor network. This is the reliability or fault tolerance 
issue. Fault tolerance is the ability to sustain sensor network functionalities without any 
interruption due to sensor node failures [25, 49]. 

The protocols and algorithms may be designed to address the level of fault tolerance 
required by the sensor networks. If the environment where the sensor nodes are deployed 
has little interference, then the protocols can be more relaxed. For example, if sensor nodes 
are being deployed in a house to keep track of humidity and temperature levels, the fault 
tolerance requirement may be low since this kind of sensor networks is not easily damaged 
or interfered with by environmental noise. On the other hand, if sensor nodes are being 
deployed in a battlefield for surveillance and detection, then the fault tolerance has to be 
high because the sensed data are critical and sensor nodes can be destroyed by hostile actions. 
As a result, the fault tolerance level depends on the application of the sensor networks, and 
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the schemes must be developed with this in mind. The number of sensor nodes deployed in 
studying a phenomenon may be in the order of hundreds or thousands. The networks must be 
able to work with this number of nodes. The density can range from few sensor nodes to few 
hundred sensor nodes in a region, which can be less than 10 m in diameter [12]. The node 
density depends on the application in which the sensor nodes are deployed. For machine 
diagnosis application, the node density is around 300 sensor nodes in a 5 x 5 m 2 region, 
and the density for the vehicle tracking application is around 10 sensor nodes per region 
[51]. In some cases, the density can be as high as 20 sensor nodes/m 3 [51]. A home may 
contain around two dozen home appliances containing sensor nodes [42], but this number 
will grow if sensor nodes are embedded into furniture and other miscellaneous items. For 
habitat monitoring application, the number of sensor nodes ranges from 25 to 100 per region. 
The density will be extremely high when a person normally containing hundreds of sensor 
nodes, which are embedded in eye glasses, clothing, shoes, watch, jewelry and the human 
body, is sitting inside a stadium watching a basketball, football or baseball game. 

As a consequence, the cost of each sensor node has to be kept low. The state-of-the-art 
technology allows a Bluetooth radio system to be less than US$10 [46]. Also, the price 
of a PicoNode is targeted to be less than US$1 [45]. The cost of a sensor node should 
be much less than US$1 in order for the sensor network to be feasible [45]. The cost of 
a Bluetooth radio, which is known to be a low-cost device, is 10 times more expensive 
than the targeted price for a sensor node. Note that a sensor node also has some additional 
units such as sensing and processing units. In addition, it may be equipped with a location- 
finding system, mobilizer or power generator depending on the applications of the sensor 
networks. As a result, the cost of a sensor node is a very challenging issue given the amount 
of functionality with a price of much less than a dollar. 

An illustration of sensor network topology Is shown in Figure 14.2. Deploying a high 
number of nodes densely requires careful handling of topology maintenance. Issues related 
to topology maintenance and change can be classified in three phases [66-77]. 

14.2.1 Pre-deployment and deployment phase 

Sensor nodes can be either thrown in mass or placed one by one in the sensor field. They 
can be deployed by dropping from a plane, delivering in an artillery shell, rocket or missile, 
throwing by a catapult (from a ship, etc.), placing in factory or placing one by one either by 
a human or a robot. Although the sheer number of sensors and their unattended deployment 
usually preclude placing them according to a carefully engineered deployment plan, the 
schemes for initial deployment must reduce the installation cost, eliminate the need for 
any preorganization and preplanning, increase the flexibility of arrangement and promote 
self-organization and fault tolerance. 

14.2.2 Post-deployment phase 

After deployment, topology changes are due to change in sensor nodes' [27, 34] position, 
reachability (due to jamming, noise, moving obstacles, etc.), available energy, malfunction- 
ing and task details. Sensor nodes may be statically deployed. However, device failure is a 
regular or common event due to energy depletion or destruction. It is also possible to have 
sensor networks with highly mobile nodes. In addition, sensor nodes and the network expe- 
rience varying task dynamics, and they may be a target for deliberate jamming. Therefore, 
all these factors cause frequent changes in sensor network topologies after deployment. 
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Figure 14.2 Sensor network topology. 



14.2.3 Re-deployment of additional nodes phase 

Additional sensor nodes can be re-deployed at any time to replace the malfunctioning nodes 
or due to changes in task dynamics. Addition of new nodes poses a need to re-organize the 
network. Coping with frequent topology changes in an ad hoc network that has myriad 
nodes and very stringent power consumption constraints requires special routing protocols. 

In a multihop sensor network, communicating nodes are linked by a wireless medium. 
One option for radio links is the use of industrial, scientific and medical (ISM) bands, listed 
in Table 14.1, which offer license-free communication in most countries. 

Some of these frequency bands are already being used for communication in cordless 
phone systems and wireless local area networks (WLANs). For sensor networks, a small- 
sized, low-cost, ultralow power transceiver is required. According to Porret etal. [43] , certain 
hardware constraints and the trade-off between antenna efficiency and power consumption 
limit the choice of a carrier frequency for such transceivers to the ultrahigh frequency range. 
They also propose the use of the 433 MHz ISM band in Europe and the 915 MHz ISM 
band in North America. The transceiver design issues in these two bands are addressed in 
References [16, 35]. 



14.3 SENSOR NETWORKS ARCHITECTURE 

The sensor nodes are usually scattered in a sensor field as shown in Figure 14.2. Each of 
these scattered sensor nodes has the capabilities to collect data and route data back to the 
sink and the end users. Data are routed back to the end user by a multihop infrastructureless 
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Table 14.1 Frequency bands available for ISM applications 



Frequency band 


Center frequency 


6765-6795 kHz 


6780 kHz 


13 553-13 567 kHz 


13,560 kHz 


26 957-27 283 kHz 


27,120 kHz 


40.66^10.70 MHz 


40.68 MHz 


433.05-434.79 MHz 


433.92 MHz 


902-928 MHz 


915 MHz 


2400-2500 MHz 


2450 MHz 


5725-5875 MHz 


5800 MH7 


24-24.25 GHz 


24.125 GHz 


61-61.5 GHz 


61.25 GHz 


122-123 GHz 


122.5 GHz 


244-246 GHz 


245 GHz 




Figure 14.3 The sensor networks protocol stack. 

architecture through the sink, as shown in Figure 14.2. The sink may communicate with the 
task manager node via Internet or satellite. 

The protocol stack used by the sink and all sensor nodes is given in Figure 14.3. This 
protocol stack combines power and routing awareness, integrates data with networking 
protocols, communicates power efficiently through the wireless medium and promotes 
cooperative efforts of sensor nodes. The protocol stack consists of the application layer, 
transport layer, network layer, data link layer, physical layer, power management plane, 
mobility management plane and task management plane [81-94]. Depending on the sensing 
tasks, different types of application software can be built and used on the application layer. 
The transport layer helps to maintain the flow of data if the sensor networks application 
requires it. The network layer takes care of routing the data supplied by the transport layer. 
Since the environment is noisy and sensor nodes can be mobile, the MAC protocol must 
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be power aware and able to minimize collision with neighbors' broadcast. The physical 
layer addresses the needs of a simple but robust modulation, transmission and receiving 
techniques. In addition, the power, mobility and task management planes monitor the power, 
movement and task distribution among the sensor nodes. These planes help the sensor nodes 
coordinate the sensing task and lower the overall power consumption. 



14.3.1 Physical layer 

The physical layer is responsible for frequency selection, carrier frequency generation, 
signal detection, modulation and data encryption. The choice of a good modulation scheme 
is critical for reliable communication in a sensor network. Binary and M-ary modulation 
schemes are compared in Shih et al. [51]. While an M-ary scheme can reduce the transmit 
on-time by sending multiple bits per symbol, it results in complex circuitry and increased 
radio power consumption. These tradeoff parameters are formulated in Shih et al. [51] 
and it is concluded that under start-up power dominant conditions, the binary modulation 
scheme is more energy-efficient. Hence, M-ary modulation gains are significant only for 
low start-up power systems. 



14.3.2 Data link layer 

The data link layer is responsible for the multiplexing of data streams, data frame detection, 
medium access and error control. 



14.3.2. 1 Medium access control 

The MAC protocol in a wireless multihop self-organizing sensor network must achieve two 
goals [61, 62, 138]. The first is the creation of the network infrastructure. Since thousands 
of sensor nodes are densely scattered in a sensor field, the MAC scheme must establish 
communication links for data transfer. This forms the basic infrastructure needed for wireless 
communication hop by hop and gives the sensor network self-organizing ability. The second 
objective is to fairly and efficiently share communication resources between sensor nodes. 
Traditional MAC schemes, described in Chapter 5, are summarized in Figure 14.4 and 
Table 14.2. 

A lot of modifications are needed in MAC protocols when applied in sensor networks. 
In a cellular system, the base stations form a wired backbone. A mobile node is only a 
single hop away from the nearest base station. The primary goal of the MAC protocol in 




Figure 14.4 Classification of MAC schemes. 
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Table 14.2 Categorization of MAC protocols 



Category Resource sharing mode Application domain Disadvantages 



Dedicated 
assignment or 
fixed allocation 

Demand based 



Random access or 
contention-based 



Pre-determined fixed 
allocation 

On demand or user 
request 

Contention when 
transmission packets 
are available 



Continuous 
traffic/provides 
bounded delay 

Variable rate and 
multimedia traffic 

Bursty traffic 



Inefficient for bursty 
traffic 

Overhead and delay 
due to reservation 
process 

Inefficient for delay- 
sensitive traffic 



such systems is the provision of high QoS and bandwidth efficiency. Power conservation 
assumes only secondary importance as base stations have unlimited power supply and the 
mobile user can replenish exhausted batteries in the handset. Hence, medium access is 
focused on dedicated resource assignment strategy. Such an access scheme is impractical 
for sensor networks as there is no central controlling agent like the base station. 

The mobile ad hoc networks (MANET), discussed in Chapter 13, are probably the closest 
peers to the sensor networks. The MAC protocol in a MANET has the task of forming the 
network infrastructure and maintaining it in the face of mobility. Hence, the primary goal 
is the provision of high QoS under mobile conditions. Although the nodes are portable 
battery-powered devices, they can be replaced by the user and hence power consumption is 
only of secondary importance. 

Therefore, the MAC protocol for sensor networks must have built-in power conservation, 
mobility management and failure recovery strategies. Although many schemes for medium 
access have been proposed for MANETs (see Chapter 13), the design of an efficient MAC 
scheme for the new regime of sensor networks is still an open research issue. The fixed 
allocation and random access versions of medium access have been discussed in References 
[54, 60]. Demand-based MAC schemes may be unsuitable for sensor networks due their 
large messaging overhead and link set-up delay. Power conservation is achieved by the 
use of power saving operation modes and by preferring time-outs to acknowledgements, 
wherever possible. 

Since radios must be turned off during idling for precious power savings, the MAC 
scheme should include a variant of TDMA [39]. Such a medium access mechanism is 
presented in Sohrabi etal. [54]. Further, contention-based channel access is deemed unsuit- 
able due to its requirement to monitor the channel at all times. It must be noted, however, 
that random medium access can also support power conservation, as in the IEEE 802. 1 1 
standard for WLANs, by turning off radios depending on the status of the net allocation 
vector. Constant listening times and adaptive rate control schemes can also help achieve 
energy efficiency in random access schemes for sensor networks [60]. 

14.3.2.2 Self-organizing MAC for senor networks (SMACS) 

The SMACS protocol [54] achieves network start-up and link-layer organization, and the 
eavesdrop-and-register (EAR) algorithm enables seamless connection of mobile nodes in 
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a sensor network. SMACS is a distributed infrastructure-building protocol which enables 
nodes to discover their neighbors and establish transmission/reception schedules for com- 
munication without the need for any local or global master nodes. In this protocol, the 
neighbor discovery and channel assignment phases are combined so that, by the time nodes 
hear all their neighbors, they will have formed a connected network. A communication 
link consists of a pair of time slots operating at a randomly chosen, but fixed frequency 
(or frequency hopping sequence). This is a feasible option in sensor networks, since the 
available bandwidth can be expected to be much higher than the maximum data rate for sen- 
sor nodes. Such a scheme avoids the necessity for network- wide synchronization, although 
communicating neighbors in a subnet need to be time-synchronized. Power conservation is 
achieved using a random wake-up schedule during the connection phase and by turning the 
radio off during idle time slots. The process is based on using an ultralow power radio to 
wake-up the neighbors. This second radio uses much less power via either a low duty cycle 
or hardware design. Usually this second radio can only transmit a busy tone. This broadcast 
tone should not disrupt any on-going data transmission, e.g. use a different channel. 

The amount of time and power needed to wake-up (start-up) a radio is not negligible and 
thus just turning off the radio whenever it is not being used is not necessarily efficient. The 
energy characteristics of the start-up time should also be taken into account when designing 
the size of the data link packets. 

The EAR protocol [49] attempts to offer continuous service to the mobile nodes under 
both mobile and stationary conditions. Here, the mobile nodes assume full control of 
the connection process and also decide when to drop connections, thereby minimizing 
messaging overhead. The EAR is transparent to the SMACS, so that the SMACS is 
functional until the introduction of mobile nodes into the network. In this model, the 
network is assumed to be mainly static, i.e. any mobile node has a number of stationary 
nodes in its vicinity. A drawback of such a time-slot assignment scheme is the possibility 
that members already belonging to different subnets might never get connected. For more 
details see Sohrabi et al. [54]. Carrier sense media access (CSMA)-based MAC scheme 
for sensor networks is presented in Woo and Culler [60]. Traditional-CSMA based schemes 
are deemed inappropriate as they all make the fundamental assumption of stochastically 
distributed traffic and tend to support independent point-to-point flows. On the contrary, the 
MAC protocol for sensor networks must be able to support variable, but highly correlated 
and dominantly periodic traffic. Any CSMA-based medium access scheme has two 
important components, the listening mechanism (sensing) and the backoff scheme. Based 
on simulations in References [60-64], the constant listen periods are energy-efficient and 
the introduction of random delay (p-persistance) provides robustness against repeated col- 
lisions. Fixed window and binary exponential decrease backoff schemes are recommended 
to maintain proportional fairness in the network. A phase change at the application level is 
also advocated to get over any capturing effects. It is proposed in this work that the energy 
consumed/throughput can serve as a good indicator of energy efficiency. 



14.3.3 Network layer 

The ad hoc routing techniques, already discussed in Chapter 13, do not usually fit the 
requirements of the sensor networks. The networking layer of sensor networks is usually 
designed according to the following principles. First of all, power efficiency is always an 
important design parameter, see Figure 14.5. Sensor networks are mostly data centric. Data 
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Phenomenon Data 
being sensed aggregation 




Figure 14.5 Multihop routing due to limited transmission range. 

aggregation is useful only when it does not hinder the collaborative effort of the sensor 
nodes. An ideal sensor network has attribute-based addressing and location awareness. One 
of the following approaches can be used to select an energy efficient route. 

(1) Maximum available power (PA) route - the route that has maximum total available 
power is preferred. 

(2) Minimum energy (ME) route - the route that consumes ME to transmit the data 
packets between the sink and the sensor node is the ME route. 

(3) Minimum hop (MH) route - the route that makes the MH to reach the sink is preferred. 
Note that the ME scheme selects the same route as the MH when the same amount 
of energy is used on every link. Therefore, when nodes broadcast with same power 
level without any power control, MH is then equivalent to ME. 

(4) Maximum minimum PA node route - the route along which the minimum PA is larger 
than the minimum PAs of the other routes is preferred. This scheme precludes the 
risk of using up a sensor node with low PA much earlier than the others because they 
are on a route with nodes which have very high PAs. 

14.3.3. 1 Data centric routing 

In data-centric routing, the interest dissemination is performed to assign the sensing tasks to 
the sensor nodes. There are two approaches used for interest dissemination: sinks broadcast 
the interest [27], and sensor nodes broadcast an advertisement for the available data [23] 
and wait for a request from the interested sinks. For an illustration see Figure 14.6. 
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Figure 14.6 Broadcasting the interest (are there any soldiers in the area?) and advertising 
(there are soldiers in the area). 




Figure 14.7 The data aggregation. 

The data-centric routing requires attribute-based naming [49, 65]. For attribute-based 
naming, the users are more interested in querying an attribute of the phenomenon, rather 
than querying an individual node. For instance 'the areas where the moisture is over 70 %' 
is a more common query than 'the moisture read by a certain node' . The attribute-based 
naming is used to carry out queries using the attributes of the phenomenon. The attribute- 
based naming also makes broadcasting, attribute-based multicasting, geo-casting and 
any-casting important for sensor networks. 

Data aggregation is a technique used to solve the implosion and overlap problems in 
data-centric routing [23]. In this technique, a sensor network is usually perceived as a 
reverse multicast tree, as shown in Figure 14.7, where the sink asks the sensor nodes to 
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report the ambient condition of the phenomena. Data coming from multiple sensor nodes 
are aggregated as if they are about the same attribute of the phenomenon when they reach 
the same routing node on the way back to the sink. Data aggregation can be perceived as a 
set of automated methods of combining the data that comes from many sensor nodes into 
a set of meaningful information [22]. With this respect, data aggregation is known as data 
fusion [23]. Also, care must be taken when aggregating data, because the specifics of the 
data, e.g. the locations of reporting sensor nodes, should not be left out. Such specifics may 
be needed by certain applications. 

14.3.3.2 Internetworking 

One other important function of the network layer is to provide internetworking with external 
networks such as other sensor networks, command and control systems and the Internet. In 
one scenario, the sink nodes can be used as a gateway to other networks. Another option is 
creating a backbone by connecting sink nodes together and making this backbone access 
other networks via a gateway. 

14.3.3.3 Flooding and gossiping 

Flooding has already been described in Chapters 7 and 1 3 as a technique used to disseminate 
information across a network. The drawbacks are [23]: (1) implosion, when duplicated 
messages are sent to the same node; (2) overlap - when two or more nodes share the same 
observing region, they may sense the same stimuli at the same time, and as a result, neighbor 
nodes receive duplicated messages; and (3) resource blindness, not taking into account the 
available energy resources. Control of the energy consumption is of paramount importance 
in WSNs; a promiscuous routing technique such as flooding wastes energy unnecessarily. 

Gossiping is a variation of flooding attempting to correct some of its drawbacks [21]. 
Nodes do not indiscriminately broadcast but instead send a packet to a randomly selected 
neighbor who, once it receives the packet, repeats the process. It is not as simple to implement 
as the flooding mechanism and it takes longer for the propagation of messages across the 
network. 

Data funneling by data aggregation concentrates, e.g. funnels, the packet flow into a 
single stream from the group of sensors to the sink. It reduces (compresses) the data by 
taking advantage of the fact that the destination is not that interested in a particular order 
of how the data packets arrive. 

In the setup phase, the controller divides the sensing area into regions and performs a 
directional flood towards each region. When the packet reaches the region the first receiving 
node becomes a border node and modifies the packet (add fields) for route cost estimations 
within the region. Border nodes flood the region with modified packets. Sensor nodes in the 
region use cost information to schedule which border nodes to use. 

In the data communication phase, when a sensor has data it uses the schedule to choose 
the border node that is to be used. It then waits for a time inversely proportional to the 
number of hops from the border. Along the way to the border node, the data packets joined 
together until they reach the border node. The border node collects all packets and then sends 
one packet with all the data back to the controller These steps are illustrated in Figure 14.8. 



SENSOR NETWORKS ARCHITECTURE 547 




Controller O 
Sensors 9 
Border Q 
nodes 




Figure 14.8 Data funneling: (a) setup phase; (b) data communication phase. 

14.3.3.4 Sensor protocols for information via negotiation (SPIN) 

A family of adaptive protocols called SPIN [23] is designed to address the deficiencies 
of classic flooding by negotiation and resource adaptation. The SPIN family of protocols 
is designed based on two basic ideas: sensor nodes operate more efficiently and conserve 
energy by sending data that describe the sensor data instead of sending the whole data, e.g. 
image, and sensor nodes must monitor the changes in their energy resources. 

The sequential assignment routing (SAR) [54] algorithm creates multiple trees where the 
root of each tree is a one-hop neighbor from the sink. Each tree grows outward from the 
sink while avoiding nodes with very low QoS (i.e. low-throughput/high delay) and energy 
reserves. At the end of this procedure, most nodes belong to multiple trees. This allows a 
sensor node to choose a tree to relay its information back to the sink. 

The low-energy adaptive clustering hierarchy (LEACH) [22] is a clustering-based 
protocol that minimizes energy dissipation in sensor networks. The purpose of LEACH 
is to randomly select sensor nodes as cluster-heads, so the high-energy dissipation in com- 
municating with the base station is spread to all sensor nodes in the sensor network. During 
the set-up phase, a sensor node chooses a random number between 0 and 1 . If this random 
number is less than the threshold T(n), the node is a cluster-head. T(n) is calculated as 



T(n) = 



P[rmod(\/P)] 



if n e G 
otherwise, 
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Figure 14.9 An example of directed diffusion [27] : (a) propagate interest, (b) set up gradient 
and (c) send data. 

where P is the desired percentage to become a cluster-head; r, the current round; and G, 
the set of nodes that have not being selected as a cluster-head in the last l/P rounds. After 
the cluster-heads are selected, the cluster-heads advertise to all sensor nodes in the network 
that they are the new cluster-heads. During the steady phase, the sensor nodes can begin 
sensing and transmitting data to the cluster-heads. After a certain period of time spent on 
the steady phase, the network goes into the set-up phase again and entering into another 
round of selecting the cluster-heads. 

14.3.3.5 Directed diffusion 

The directed diffusion data dissemination is discussed in Intanagonwiwat et al. [27]. The 
sink sends out interest, which is a task description, to all sensors, as shown in Figure 14.9(a). 
The task descriptors are named by assigning attribute-value pairs that describe the task. Each 
sensor node then stores the interest entry in its cache. The interest entry contains a timestamp 
field and several gradient fields. As the interest is propagated throughout the sensor network, 
the gradients from the source back to the sink are set up as shown in Figure 14.9(b). When 
the source has data for the interest, the source sends the data along the interest's gradient 
path as shown in Figure 14.9(c). The interest and data propagation and aggregation are 
determined locally. Also, the sink must refresh and reinforce the interest when it starts to 
receive data from the source. The directed diffusion is based on data-centric routing where 
the sink broadcasts the interest. 

14.3.4 Transport layer 

The need for transport layer is discussed in References [44, 46]. This layer is especially 
needed when access to the system is planned through Internet or other external networks. 
TCP with its current transmission window mechanisms, as discussed in Chapter 9, does 
match to the extreme characteristics of the sensor network environment. In Bakre and 
Badrinath [62] an approach called TCP splitting is considered to make sensor networks 
interact with other networks such as the Internet. In this approach, TCP connections are 
ended at sink nodes, and a special transport layer protocol can handle the communications 
between the sink node and sensor nodes. As a result, the communication between the user 
and the sink node is by UDP or TCP via the Internet or Satellite. Owing to sensor node 
limited memory, the communication between the sink and sensor nodes may be purely by 
UDP type protocols. 



SENSOR NETWORKS ARCHITECTURE 549 



Unlike protocols such as TCP, the end-to-end communication schemes in sensor net- 
works are not based on global addressing. These schemes must consider that attribute-based 
naming is used to indicate the destinations of the data packets. 

As a conclusion, TCP variants developed in Chapter 9 for the traditional wireless net- 
works are not suitable for WSNs where the notion of end-to-end reliability has to be rede- 
fined due to the specific nature of the sensor network which comes with features such as: (1) 
multiple senders, the sensors and one destination, the sink, which creates a reverse multicast 
type of data flow; (2) for the same event there is a high level of redundancy or correlation 
in the data collected by the sensors and thus there is no need for end-to-end reliability 
between individual sensors and the sink, but instead between the event and the sink; (3) on 
the other hand there is need of end-to-end reliability between the sink and individual nodes 
for situations such as re-tasking or reprogramming; (4) the protocols developed should be 
energy aware and simple enough to be implemented in the low-end type of hardware and 
software of many WSN applications. 

Pump slowly, fetch quickly (PFSQ) is designed to distribute data from a source node 
by pacing the injection of packets into the network at relatively low speed (pump slowly) 
which allows nodes that experience data loss to aggressively recover missing data from 
their neighbors (fetch quickly). The goals of this protocols are: (1) to ensure that all data 
segments are delivered to the intended destinations with minimum especial requirements 
on the nature of the lower layers; (2) to minimize the number of transmissions to recover 
lost information; (3) to operate correctly even in situations where the quality of the wireless 
links is very poor; and (4) to (5) provide loose delay bounds for data delivery to all intended 
receivers. PFSQ has been designed to guarantee sensor-to-sensor delivery and to provide 
end-to-end reliability for control management distribution from the control node (sink) to 
the sensors. It does not address congestion control. 

Event-to-sink reliable transport (ESRT), illustrated in Figure 14.10, is designed to 
achieve reliable event detection (at the sink node) with a protocol that is energy aware 
and has congestion control mechanisms. 

The protocol provides self-configuration, even in the case of a dynamic topology. For 
energy awareness sensor nodes are notified to decrease their frequency of reporting if the 
reliability level at the sink node is above the minimum. Congestion control takes advantage 
of the high level of correlation between the data flows corresponding to the same event. 
Sink is only interested in the collective information from a group of sensors, not in their 




Figure 14.10 Event-to-sink reliable transport (ESRT). 
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individual report. Most of the complexity of the protocol falls on the sink node, minimizing 
the requirements on the sensor nodes. 

14.3.5 Application layer 

In this section, we discuss three application layer protocols, i.e. sensor management protocol 
(SMP), task assignment and data advertisement protocol (TADAP), and sensor query and 
data dissemination protocol (SQDDP). Additional work is expected in this segment of 
sensor networks. 

14.3.5. 1 Sensor management protocol 

Sensor networks have many different application areas, and accessing them through 
networks such as the Internet is an option [44]. Designing an application layer manage- 
ment protocol has several advantages. It makes the hardware and software of the lower 
layers transparent to the sensor network management applications. System administrators 
interact with sensor networks using the sensor management protocol (SMP). Unlike many 
other networks, sensor networks consist of nodes that do not have global IDs, and they are 
usually infrastructureless. Therefore, SMP needs to access the nodes by using attribute- 
based naming and location-based addressing, which are explained in detail earlier in this 
section. SMP is a management protocol that provides the software operations needed to 
perform the following administrative tasks: introducing the rules related to data aggrega- 
tion, attribute-based naming and clustering to the sensor nodes, exchanging data related to 
the location-finding algorithms, time synchronization of the sensor nodes, moving sensor 
nodes, turning sensor nodes on and off, querying the sensor network configuration and the 
status of nodes, re-configuring the sensor network, and authentication, key distribution and 
security in data communications. Discussion of these issues is given in References [15, 41, 
48,49,65,95-97]. 

The task assignment and data advertisement protocol controls interest dissemination in 
sensor networks. As discussed earlier in this section (Figure 14.6), users send their interest 
to a sensor node, a subset of the nodes or whole network. This interest may be about a certain 
attribute of the phenomenon or a triggering event. Another approach is the advertisement 
of available data in which the sensor nodes advertise the available data to the users, and the 
users query the data they are interested in. An application layer protocol that provides the 
user software with efficient interfaces for interest dissemination is useful for lower layer 
operations, such as routing, as explained in Figure 14.6. 

The sensor query and data dissemination protocol (SQDDP) provides user applications 
with interfaces to issue queries, respond to queries and collect incoming replies. Note that 
these queries are generally not issued to particular nodes. Instead, attribute-based or location- 
based naming is preferred. For instance, 'the locations of the nodes in the supermarket where 
the selling items stock is bellow a threshold and should be re-supplied' is an attribute-based 
query. Similarly, 'the selling item stock size at the node A' is an example for location-based 
naming. 

Sensor query and tasking language (SQTL) [49] is proposed as an application that 
provides even a larger set of services. SQTL supports three types of events, which are defined 
by the keywords receive, every and expire. Receive keyword defines events generated by 
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a sensor node when the sensor node receives a message; every keyword defines events 
occurring periodically due to a timer time-out; and the expire keyword defines the events 
occurring when a timer is expired. If a sensor node receives a message that is intended for it 
and contains a script, the sensor node then executes the script. Although SQTL is proposed, 
different types of SQDDP can be developed for various applications. The use of SQDDPs 
may be unique to each application. 

14.4 MOBILE SENSOR NETWORKS DEPLOYMENT 

A mobile sensor network is composed of a distributed collection of nodes, each of which 
in addition to sensing, computation and communication also has locomotion capabilities. 
Locomotion facilitates a number of useful network capabilities, including the ability to self- 
deploy and self-repair. The use of mobile sensor networks includes applications ranging 
from urban combat scenarios to search-and-rescue operations and emergency environment 
monitoring. An example is a scenario involving a hazardous materials leak in an urban 
environment. In general, we would like to be able to throw a number of sensor nodes into a 
building through a window or doorway. The nodes are equipped with chemical sensors that 
allow them to detect the relevant hazardous material, and deploy themselves throughout 
the building in such a way that they maximize the area 'covered' by these sensors. Data 
from the nodes are transmitted to a base station located safely outside the building, where 
they are assembled to form a live map showing the concentration of hazardous compounds 
within the building. 

For the sensor network to be useful in this scenario, the location of each node must 
be determined. In urban environments, accurate localization using GPS is generally not 
possible (due to occlusions or multipath effects), while landmark-based approaches require 
prior models of the environment that may be either unavailable (destroyed), incomplete or 
inaccurate. This is particularly true in disaster scenarios, were the environment may have 
undergone recent (and unplanned) structural modifications. Therefore it is of interest to 
determine the location of network nodes using the nodes themselves as landmarks. This 
particular technique, however, requires that nodes maintain line-of-sight relationships with 
one another. This condition also enables establishment of line-of-sight communications 
links which operate with minimum energy consumption. An additional demand is that 
nodes should deploy in such a way that they maximize the area 'covered' by the network, 
while simultaneously ensuring that each node can be seen by at least one other node. 

The concept of coverage as a paradigm for evaluating multi-robot [98-1 14] systems was 
introduced in Gage [93]. Three basic types of coverage were defined: blanket coverage, 
where the objective is to achieve a static arrangement of nodes that maximizes the total 
detection area; barrier coverage, where the objective is to minimize the probability of 
undetected penetration through the barrier; and sweep coverage, which is more or less 
equivalent to a moving barrier. The problem of exploration and map-building by a single 
robot in an unknown environment has been considered by a number of authors [113-115]. 
The frontier-based approach described in References [1 13,1 14] proceeds by incrementally 
building a global occupancy map of the environment, which is then analyzed to find the 
'frontiers' between free and unknown space. The robot is directed to the nearest such 
frontier. The network deployment algorithm described in Howard et al. [98] shares a number 
of similarities with [113]. It also builds a global occupancy grid of the environment and 
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direct nodes to the frontier between free and unknown space. However, in this deployment 
algorithm the map is built entirely from live, rather than stored, sensory data. It also satisfies 
an additional constraint: that each node must be visible to at least one other node. 

Multi-robot exploration and map-building has been explored by a number of authors 
[100,106,109,1 1 1] who use a variety of techniques ranging from topological matching [100] 
to fuzzy inference [106] and particle filters [111]. Once again, there are two key differences 
between these earlier works and the work described in Howard et al. [98], where maps are 
built entirely from live, not stored, sensory data, and the deployment algorithm must satisfy 
an additional constraint (i.e. line-of-sight visibility). 

A distributed algorithm for the deployment of mobile robot teams has been described 
in [108], where the concept of 'virtual pheromones' is introduced. These are localized 
messages that are emitted by one robot and detected by nearby robots. Virtual pheromones 
can be used to generate either 'gas expansion' or 'guided growth' deployment models. The 
key advantage of this approach is that the deployment algorithm is entirely distributed, and 
has the potential to respond dynamically to changes in the environment. This algorithm 
does, however, lead to relatively slow deployment; it is also unclear, from the published 
results, how effective this algorithm is at producing good area coverage. A somewhat similar 
algorithm based on artificial potential fields is described in Howard et al. [104]. 

The algorithm elaborated in this section is an incremental deployment algorithm [98], 
where nodes are deployed one at a time, with each node making use of information gathered 
by the previously deployed nodes to determine its ideal deployment location. The algorithm 
aims to maximize the total network coverage, i.e. the total area that can be 'seen' by the 
network. At the same time, the algorithm must ensure that the visibility constraint is satisfied, 
i.e. each node must be visible to at least one other node. The algorithm relies on a number 
of key assumptions: 

(1) Homogeneous nodes - all nodes are assumed to be identical. We also assume that 
each node is equipped with a range sensor, a broadcast communications device, and 
is mounted on some form of mobile platform. 

(2) Static environment - the environment is assumed to be static, at least to the extent 
that gross topology remains unchanged while the network is deploying. 

(3) Model-free - this algorithm is intended for applications in which environment models 
are unavailable; indeed, a key task for the network may be to generate such models. 

(4) Full communication - all nodes in the network can communicate with some remote 
base station on which the deployment algorithm is executed. Note that this does not 
automatically imply that all nodes must be within radio range of the base station; 
the nodes may, for example, form an ad hoc multihop network. 

(5) Localization - the position of each node is known in some arbitrary global coordinate 
system. This technique does not require external landmarks or prior models of the 
environment, but does require that each node is visible to at least one other node. It 
is this requirement that gives rise to the visibility constraint, i.e. each node must be 
visible to at least one other node at its deployed location. 

Two performance metrics are of interest: coverage, i.e. the total area visible to the 
network's sensors; and time, i.e. the total deployment time, including both the time taken to 
perform the necessary computations and the time taken to physically move the nodes. The 
objective is to maximize the coverage while minimizing the deployment time. 
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The algorithm has four phases: initialization, selection, assignment and execution. 

• Initialization - nodes are assigned one of three states: waiting, active or deployed. As 
the names suggest, a waiting node is waiting to be deployed, an active node is in the 
process of deploying, and a deployed node has already been deployed. Initially, the state 
of all nodes is set to waiting, with the exception of a single node that is set to deployed. 
This node provides a starting point, or 'anchor', for the network, and is not subject to the 
visibility constraint. 

• Selection - sensor data from the deployed nodes is combined to form a common map 
of the environment (occupancy grid). This map is analyzed to select the deployment 
location, or goal, for the next node. Each cell in this grid is assigned one of three states: 
free, occupied or unknown. A cell is free if it is known to contain no obstacles, occupied 
if it is known to contain one or more obstacles, and unknown otherwise. In the combined 
occupancy grid, any cell that can be seen by one or more nodes will be marked as either 
free or occupied; only those cells that cannot be seen by any node will marked as unknown. 
We can therefore ensure that the visibility constraint is satisfied by always selecting goals 
that lie somewhere in free space. 

• Assignment - in the simplest case, the selected goal is assigned to a waiting node, and 
the node's state is changed from waiting to active. More commonly, assignment is com- 
plicated by the fact that deployed nodes tend to obstruct waiting nodes, necessitating a 
more complex assignment algorithm. That is, the algorithm may have to re-assign the 
goals of a number of previously deployed nodes, changing their state from deployed to 
active. 

• Execution - active nodes are deployed sequentially to their goal locations. The state of 
each node is changed from active to deployed upon arrival at the goal. The algorithm 
iterates through the selection, assignment and execution phases, terminating only when 
all nodes have been deployed. Performance examples of the algorithm can be found in 
Howard etal. [98]. 



14.5 DIRECTED DIFFUSION 

As already indicated in Section 14.3.3, directed diffusion consists of several elements. Data 
is named using attribute-value pairs. A sensing task (or a subtask thereof) is disseminated 
throughout the sensor network as an interest for named data. This dissemination sets up 
gradients within the network designed to 'draw' events (i.e. data matching the interest). 
Events start flowing towards the originators of interests along multiple paths. The sensor 
network reinforces one, or a small number of these paths as illustrated in Figure 14.9. In 
this section we elaborate these elements in more detail. 

In directed diffusion, task descriptions are named by, for example, a list of attribute-value 
pairs that describe a task. For example a surveillance system (military or civil application) 
which is expected to report an intrusion in a given area might be described as 

type — human / / detect location 

interval — 20 ms / / send back events every 20 ms 

duration — 10 s / / .. for the next 10 s 

rect — [—100, 100, 200, 400] //from sensors within rectangle 
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For simplicity, we choose the subregion representation to be a rectangle defined on some 
coordinate system; in practice, this might be based on GPS coordinates. The task description 
specifies an interest for data matching the attributes. For this reason, such a task description 
is called an interest. The data sent in response to interests are also named using a similar 
naming scheme. Thus, for example, a sensor that detects an intrusion might generate the 
following data: 

type — human / / type of intruder seen 

instance — military / 7 instance of this type 
location = [125, 220] //node location 
intensity — 0.6 // signal amplitude measure 

confidence — 0.85 // confidence in the match 

timestamp — 01:20:40 //event generation time 

Given our choice of naming scheme, we now describe how interests are diffused through 
the sensor network. Suppose that a task, with a specified type and rect, a duration of 
10 min and an interval of 10 ms, is instantiated at a particular node in the network. The 
interval parameter specifies an event data rate; thus, in our example, the specified data rate is 
100 events per second. This sink node records the task; the task state is purged from the 
node after the time indicated by the duration attribute. 

For each active task, the sink periodically broadcasts an interest message to each of 
its neighbors. This initial interest contains the specified rect and duration attributes, but 
contains a much larger interval attribute. Intuitively, this initial interest may be thought of 
as exploratory; it tries to determine if there indeed are any sensor nodes that detect the 
human intrusion. To do this, the initial interest specifies a low data rate (in our example, 
one event per second). Then, the initial interest takes the following form: 

type — human 
interval —Is 

rect = [-100, 200, 200, 400] 
timestamp — 01:20:40 
expiresAt — 01:30:40 

The interest is periodically refreshed by the sink. To do this, the sink simply re-sends the 
same interest with a monotonically increasing timestamp attribute. Every node maintains 
an interest cache. Each item in the cache corresponds to a distinct interest. Two interests 
are distinct if their type, interval or rect attributes are different. Interest entries in the cache 
do not contain information about the sink. Thus, interest state scales with the number of 
distinct active interests. The definition of distinct interests also allows interest aggregation. 
Two interests I\ and 7 2 , with identical types, completely overlapping rect attributes, can, in 
some situations, be represented by a single interest entry. 

An entry in the interest cache has several fields. A timestamp field indicates the timestamp 
of the last received matching interest. The interest entry also contains several gradient fields, 
up to one per neighbor. Each gradient contains a data rate field requested by the specified 
neighbor, derived from the interval attribute of the interest. It also contains a duration 
field, derived from the timestamp and expiresAt attributes of the interest, and indicating the 
approximate lifetime of the interest. 

When a node receives an interest, it checks to see if the interest exists in the cache. If no 
matching entry exists (where a match is determined by the definition of distinct interests 
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specified above), the node creates an interest entry. The parameters of the interest entry 
are instantiated from the received interest. This entry has a single gradient towards the 
neighbor from which the interest was received, with the specified event data rate. In the 
above example, a neighbor of the sink will set up an interest entry with a gradient of one 
event per second towards the sink. For this, it must be possible to distinguish individual 
neighbors. Any locally unique neighbor identifier may be used for this purpose. 

If there exists an interest entry, but no gradient for the sender of the interest, the node 
adds a gradient with the specified value. It also updates the entry's timestamp and duration 
fields appropriately. Finally, if there exist both an entry and a gradient, the node simply 
updates the timestamp and duration fields. 

After receiving an interest, a node may decide to re-send the interest to some subset of 
its neighbors. To its neighbors, this interest appears to originate from the sending node, 
although it might have come from a distant sink. This is an example of a local interaction. In 
this manner, interests diffuse throughout the network. Not all received interests are re-sent. 
A node may suppress a received interest if it recently re-sent a matching interest. 

Generally speaking, there are several possible choices for neighbors, as presented in 
Figure 14.11. The simplest alternative is to re-broadcast the interest to all neighbors. It may 
also be possible to perform geographic routing, using some of the techniques described 
in Chapters 7 and 13. This can limit the topological scope for interest diffusion, thereby 
resulting in energy savings. Finally, in an immobile sensor network, a node might use cached 




Figure 14.11 Diffusion: (a) gradient establishment; (b) reinforcement; (c) multiple sources; 
(d) multiple siniks; and (e) repair. 
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data to direct interests. For example, if in response to an earlier interest, a node heard from 
some neighbor A data sent by some sensor within the region specified by the rect attribute, 
it can direct this interest to A, rather than broadcasting to all neighbors. 

Figure 14.11(a) shows the gradients established in the case where interests are flooded 
through a sensor field. Unlike the simplified description in Figure 14.9(b), notice that every 
pair of neighboring nodes establishes a gradient towards each other. This is a consequence 
of local interactions. 

For a sensor network, a gradient specifies both a data rate and a direction in which to 
send events. More generally, a gradient specifies a value and a direction. Figure 14.9(c) 
implicitly depicts binary valued gradients. In the sensor networks presented in this section, 
gradients have two values that determine the event reporting rate. In other sensor networks, 
gradient values might be used to, for example, probabilistically forward data along different 
paths, achieving some measure of load balancing. 



14.5.1 Data propagation 

A sensor node that is within the specified rect processes interests as described in the previous 
section. In addition, the node tasks its local sensors to begin collecting samples. A sensor 
node that detects a target searches its interest cache for a matching interest entry. In this 
case, a matching entry is one whose rect encompasses the sensor location, and the type 
of the entry matches the detected target type. When it finds one, it computes the highest 
requested event rate among all its outgoing gradients. The node tasks its sensor subsystem 
to generate event samples at this highest data rate. In the previous example, this data rate 
is initially one event per second (until reinforcement is applied). The source then sends to 
each neighbor for whom it has a gradient, an event description every second of the form: 



type — human 
instance — military 
location — [125, 220] 
intensity — 0.6 
confidence = 0.85 
timestamp — 01:20:40 



/ / type of intruder seen 

// instance of this type 

/ / node location 

/ / signal amplitude measure 

/ / confidence in the match 

/ / local time when event was generated 



This data message is unicast individually to the relevant neighbors. A node that receives 
a data message from its neighbors attempts to find a matching interest entry in its cache. The 
matching rule is as described in the previous paragraph. If no match exists, the data message 
is silently dropped. If a match exists, the node checks the data cache associated with the 
matching interest entry for loop prevention. If a received data message has a matching data 
cache entry, the data message is silently dropped. Otherwise, the received message is added 
to the data cache and the data message is re-sent to the node's neighbors. 

By examining its data cache, a node can determine the data rate of received events. To re- 
send a received data message, a node needs to examine the matching interest entry's gradient 
list. If all gradients have a data rate that is greater than or equal to the rate of incoming events, 
the node may simply send the received data message to the appropriate neighbors. However, 
if some gradients have a lower data rate than others (caused by selectively reinforcing paths, 
then the node may downconvert to the appropriate gradient. 
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14.5.2 Reinforcement 

The sink initially diffuses an interest for a low event-rate notification (one event per second). 
Once sources detect a matching target, they send low-rate events, possibly along multiple 
paths, towards the sink. After the sink starts receiving these low data rate events, it reinforces 
one particular neighbor in order to 'draw down' higher quality (higher data rate) events. 
In general, this feature of directed diffusion is achieved by data driven local rules. One 
example of such a rule is to reinforce any neighbor from which a node receives a previously 
unseen event. To reinforce this neighbor, the sink re-sends the original interest message but 
with a smaller interval (higher data rate). The reinforcement propagates back, and one way 
to chose reinforced path is to pick up one with low delay, as shown in Figure 14.11(b). 
Similar mechanisms are used to handle multiple source and multiple sinks scenarios, as 
shown in Figure 14.1 1(c) and (d). 

In directed diffusion, intermediate nodes on a previously reinforced path can also apply 
the reinforcement rules. This is useful to enable local repair of failed or degraded paths as 
indicated in Figure 14.1 1(e). 



14.6 AGGREGATION IN WIRELESS SENSOR NETWORKS 

Data fusion or aggregation is an important concept in sensor networks. The key idea is to 
combine data from different sensors to eliminate redundant transmissions, and still provide 
a rich, multidimensional view of the environment being monitored. This concepts shifts 
the focus from address-centric approaches (finding routes between pairs of end nodes) to 
a more data-centric approach (finding routes from multiple sources to a destination that 
allows in-network consolidation of data). 

Consider a network of n sensor nodes 1,2, ... ,n and a sink node t labeled n + 1 
distributed over a region. The locations of the sensors and the sink are fixed and known 
apriori. Each sensor produces some information as it monitors its vicinity. We assume that 
each sensor generates one data packet per time unit to be transmitted to the base station. For 
simplicity, we refer to each time unit as a transmission cycle or simply cycle. We assume 
that all data packets have size k bits. The information from all the sensors needs to be 
gathered at each cycle and sent to the sink for processing. We assume that each sensor has 
the ability to transmit its packet to any other sensor in the network or directly to the sink. 
Further, each sensor has a battery with finite, non-replenishable energy Ej . Whenever 
a sensor transmits or receives a data packet, it consumes some energy from its battery. 
The sink has an unlimited amount of energy available to it. Typical assumptions used in 
the modeling of energy consumption are that sensor consumes e e i ec = 50 nJ/b to run the 
transmitter or receiver circuitry and e amp = 100 pj/bit/m 2 for the transmitter amplifier. 
Thus, the energy consumed by a sensor i in receiving a k-bit data packet is given by, 
Rxi — e e i ec x Ic, while the energy consumed in transmitting a data packet to sensor j is 
given by, Tx t j — e e i ec x k + e amp x df j x k, where dij is the distance between nodes i 
and;. 

We define the lifetime T of the system to be the number of cycles until the first sensor 
is drained of its energy. A data gathering schedule specifies, for each cycle, how the data 
packets from all the sensors are collected and transmitted to the base station. A schedule 
can be thought of as a collection of T directed trees, each rooted at the base station and 
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spanning all the sensors, i.e. a schedule has one tree for each round. The lifetime of a 
schedule equals the lifetime of the system under that schedule. The objective is to find a 
schedule that maximizes the system lifetime T . 

Data aggregation performs in-network fusion of data packets, coming from different 
sensors enroute to the sink, in an attempt to minimize the number and size of data trans- 
missions and thus save sensor energies. Such aggregation can be performed when the data 
from different sensors are highly correlated. As usual, we make the simplistic assumption 
that an intermediate sensor can aggregate multiple incoming packets into a single outgoing 
packet. 

The problem is to find a data gathering schedule with maximum lifetime for a given 
collection of sensors and a sink, with known locations and the energy of each sensor, where 
sensors are permitted to aggregate incoming data packets. 

Consider a schedule S with lifetime T cycles. Let fj be the total number of packets 
that node i (a sensor) transmits to node j (a sensor or sink) in S. The energy constraints at 
each sensor, impose 

n+l n 

^2 fij ■ Txij + fj,i Rxi < Ei,i = 1,2, n. 

7=1 7=1 

The schedule S induces a flow network G = (V, E). The flow network G is a directed graph 
having as nodes all the sensors and the sink, and edges (i, j) with capacity f j whenever 

fu > o. 

If S is a schedule with lifetime T, and G is the flow network induced by S, then, for 
each sensor s, the maximum flow from s to the sink t in G is > T . This is due to the fact 
that each data packet transmitted from a sensor must reach the base station. The packets 
from s could possibly be aggregated with one or more packets from other sensors in the 
network. Intuitively, we need to guarantee that each of the T values from s influences the 
final value(s) received at the sink. In terms of network flows, this implies that sensor s must 
have a maximum s — t flow of size > T to the sink in the flow network G. Thus, a necessary 
condition for a schedule to have lifetime T is that each node in the induced flow network 
can push flow T to the sink. 

Now, we consider the problem of finding a flow network G with maximum T, that allows 
each sensor to push flow T to the base station, while respecting the energy constraints at 
all the sensors. What needs to be found are the capacities of the edges of G. Such a flow 
network G will be referred to as an admissible flow network with lifetime T. An admissible 
flow network with maximum lifetime is called an optimal admissible flow network. 

An optimal admissible flow network can be found using the integer program with linear 
constraints. If for each sensor k — 1,2, ... ,n, nj k j is a flow variable indicating the flow 
that k sends to the sink t over the edge (i, j), the integer program is given by: 

Maximize T with 

n+l n 

^ fj ■ Txij + Y fjj ■ Rxi < E it i = 1,2, ... ,n 

7=1 7=1 
n n+l 

Ttjj = Tt-j, for all i — 1, 2, . . . , n and i ^ k, 
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n ft+1 

7 = 1 7=1 

0 < tt,^' < f u , for all / = 1, 2, . . . , n and y = 1, 2, . . . , n + 1 

1=1 

The first line imposes the energy constraint per node; the next two lines enforce the flow 
conservation principle at a sensor; the next line ensures that the capacity constraints on the 
edges of the flow network are respected and the last line ensures that T flow from sensor k 
reaches the sink. 

Now we can get a schedule from an admissible flow network. A schedule is a collection 
of directed trees rooted at the sink that span all the sensors, with one such tree for each 
cycle. Each such tree specifies how data packets are gathered and transmitted to the sink. 
These trees are referred to as aggregation trees. An aggregation tree may be used for one 
or more cycles. The number of cycles / for which an aggregation tree is used is indicated 
by associating the value / with each one of its edges. In the following / is referred to as 
the lifetime of the aggregation tree. The depth of a sensor v is the average of its depths in 
each of the aggregation trees, and the depth of the schedule is be nmx{depth(v) : v £ V}. 

Figure 14.12 shows an admissible flow network G with lifetime T — 50 and two ag- 
gregation trees A t and Ai, with lifetimes 30 and 20, respectively. By looking at one of 
these trees, say A\, we see that, for each one of 30 cycles, sensor 2 transmits one packet to 
sensor 1, which in turn aggregates it with its own data packet and then sends one data packet 
to the base station. Given an admissible flow network G with lifetime T and a directed tree 




G 



Figure 14.12 An admissible flow network G with lifetime 50 rounds and two aggregation 
trees A\ and A2 with lifetimes 30 and 20 rounds, respectively. The depth of 
the schedule with aggregation trees Aj and A2 is 2. 
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A rooted at the sink t with lifetime /, we define the (A, /)- reduction G' ofG to be the flow 
network that results from G after reducing the capacities of all of its edges, that are also 
in A, by /. We call G' the (A, f)-reduced G. An (A, /^-reduction G' of G is feasible if 
the maximum flow from v to the sink t in G' is > T — f for each vertex v in G'. Note that 
A does not have to span all the vertices of G, and thus it is not necessarily an aggregation 
tree. Moreover, if A is an aggregation tree, with lifetime /, for an admissible flow network 
G with lifetime T, and the (A, /)-reduction of G is feasible, then the (A, /)-reduced flow 
network G' of G is an admissible flow network with lifetime T — f . Therefore, we can 
devise a simple iterative algorithm, to construct a schedule for an admissible flow network 
G with lifetime T, provided we can find such an aggregation tree A. 

Aggretree (G, T, t) 

1 initialize / •<— 1 

2 let A = (V 0 , E 0 ) where V D = {t} and E Q = 0 

3 while A does not span all the nodes of G do 



4 for each edge e — (i, j) € G such that ; ^ V 0 and j € V 0 do 

5 let A' be A together with the edge e 

6 / / check if the (A', l)-reduction of G is feasible 

7 let G r be the (A', l)-reduction of G 

8 if MAXFLOW(u, t, G r ) > T - 1 for all nodes v of G 

9 / / replace A with A' 

10 V 0 «- V 0 U {/}, E 0 ^E 0 U M 

1 1 break 



12 let c m ; n be the minimum capacity of the edges in A 

13 let G r be the (A, c m i n )-reduction of G 

14 if MAXFLOWO, t, G r ) >T — c min for all nodes u of G 

15 /" ■< t* m i n 

16 replace G with the (A, /)-reduction of G 

17 return /, G, A 

The aggretree (G, 7\ f) algorithm can be used to obtain an aggregation tree A with 
lifetime / from an admissible flow network G with lifetime T > f. Tree A is formed as 
follows. Initially A contains just the sink t. While A does not span all the sensors, we find 
and add to A an edge e — (i, j) , where i £ A and j e A, provided that the (A', /)-reduction 
of G is feasible - here A' is the tree A together with the edge e and / is the minimum of the 
capacities of the edges in A'. Given a flow network G and sink t such that each sensor has 
a minimum s — t cut of size > T (i.e. the maximum flow from s to t in G is > 7"), we can 
prove that it is always possible to find a sequence of aggregation trees, via the algorithm, that 
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can be used to aggregate T data packets from each of the sensors. The proof of correctness 
is based on a minimax theorems in graph theory [116, 117]. 

Experimental results show [118] that, for a network with 60 nodes, the above algorithm 
can improve the network lifetime by a factor of more than 20. 

14.7 BOUNDARY ESTIMATION 

An important problem in sensor networking applications is boundary estimation [79, 80, 
127, 129]. Consider a network sensing a field composed of two or more regions of distinct 
behavior (e.g. differing mean values for the sensor measurements). An example of such a 
field is depicted in Figure 14.13(a). In practice this may represent the bound of the area 
under the fire or contaminated area. Boundary estimation is the process of determining the 
delineation between homogeneous regions. By transmitting to the sink only the information 
about the boundary instead of the transmission from each sensor, a significant aggregation 
effect can be achieved. There are two fundamental limitations in the boundary estimation 
problem. First, the accuracy of a boundary estimate is limited by the spatial density of sensors 
in the network and by the amount of noise associated with the measurement process. Second, 
energy constraints may limit the complexity of the boundary estimate that is ultimately 
transmitted to a desired destination. 

The objective is to consider measurements from a collection of sensors and determine 
the boundary between two fields of relatively homogeneous measurements. 




(c) (d) 



Figure 14.13 Sensing an inhomogeneous field, (a) Points are sensor locations. The environ- 
ment has two conditions indicated by the gray and white regions of the square, 
(b) The sensor network domain is partitioned into square cells, (c) Sensors 
within the network operate collaboratively to determine a pruned partition 
that matches the boundary, (d) Final approximation to the boundary between 
the two regions which is transmitted to a remote point. 
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We presume a hierarchical structure of 'clusterheads' which manage measurements 
from nodes below them in the hierarchy. Thus, the nodes in each square of the partition 
communicate their measurements to a clusterhead in the square. Index the squares at the 
finest scale by row and column (i, j). The clusterhead in square (i, j) computes the average 
of these measurements to obtain a value x t j : N(pLjj ,a 2 /mjj), where /x ; j is the mean value, 
er 2 is the noise variance for each sensor measurement, and m, j is the number of nodes in 
square Thus we assume sensor measurements that have a Gaussian distribution. For 
simplicity, we assume m,j = 1. The random distribution is to account for noise in the 
system as well as for the small probability of node failure. 

A possible approach to the boundary estimation problem is to devise a hierarchical 
processing strategy that enables the nodes to collaboratively determine a nonuniform rect- 
angular partition of the sensor domain that is adapted to the boundaries [119-125]. The 
partition will have high, fine resolution along the boundary, and low, coarse resolution in 
homogeneous regions of the field, as depicted in Figure 14.13. The partition effectively 
provides a 'staircase'-like approximation to the boundary. 

The estimation process partitions the sensor domain of a normalized unit square [0, 1] 
into n sub-squares of sidelength l/^/n, as shown in Figure 14.13(b). The sidelength l/*/n 
is the finest resolution in the analysis. In principle, this initial partition can be generated by 
a recursive dyadic partition (RDP). First divide the domain into four sub-squares of equal 
size. Repeat this process again on each sub-square. Repeat this (1/2) log 2 n — J times. This 
gives rise to a complete RDP of resolution 1 /+fn (the rectangular partition of the sensing 
domain shown in Figure 14.13(b). The RDP process can be represented with a quadtree 
structure. The quadtree can be pruned back to produce an RDP with nonuniform resolution 
as shown in Figure 14. 13(c). The key issues are: (1) how to implement the pruning process 
in the sensor network; and (2) how to determine the best pruned tree. 

Let P n denote the set of all RDPs, including the initial complete RDP and all possible 
prunings. For a certain RDP P e P„, on each square of the partition, the estimator of the 
field averages the measurements from the sensors in that square and sets the estimate of the 
field to that average value. This results in a piecewise constant estimate, denoted by 9, of 
the field. This estimator will be compared with the data x — The empirical measure 

of performance is the sum-of-squared errors between 6 — 6(P) and the data x — {Xij}. 

A(9,x)= /../) v, ; r (14.1) 

The complexity penalized estimator is defined by References [119-125]: 

<9„ = arg min A[9(P), x] + 2a 2 p(n)Ng iP) (14.2) 

9(P)PeP„ 

where a 1 is the noise variance, Ne(py denotes the total number of squares in the partition 
P, and pin) is a certain monotonically increasing function of n that discourages unneces- 
sarily high-resolution partitions [appropriate choices of p(n) will be discussed below]. The 
optimization in Equation (14.2) can be solved using a bottom-up tree pruning algorithm in 
0(n) operations [122, 126, 128,]. At each level of the hierarchy, the clusterhead receives 
the best sub-parti tion/subtree estimates from the four clusterheads below it, and compares 
the total cost of these estimates with the cost of the estimate equal to the average of all 
sensors in that cluster to make the decision on pruning. 
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14.7.1 Number of RDPs in P 

Set P of RDPs consists of all RDPs resulting from pruning P j , the uniform partition of the 
unit square into n squares of sidelength 1 /*/n. We need to determine how many RDPs there 
are in P or, more specifically, we need to know how many partitions there are with exactly 
I squares/leafs. Since the RDP is based on recursive splits into four, the number of leafs in 
every partition in P is of the form £ = 3m + 1, for some integer 0 < m < (n — l)/3. The 
integer m corresponds to the number of recursive splits. For each RDP having 3m + 1 leafs 
there is a corresponding partially ordered sequence of m split points (at dyadic positions in 
the plane). In general, there are 

/ n \ n\ 
\m' (n — m)\m\ 

possible selections of m points from n (n corresponding to the vertices of the finest resolution 
partition, Pj). This number is an upper bound on the number of partitions in P with 
I = 3m + 1 leafs (since RDPs can only have dyadic split points). 



14.7.2 Kraft inequality 

Let ©„ denote the set of all possible models of the field. This set contains piecewise constant 
models (constant on the dyadic squares corresponding to one of the partitions in P n ). The 
constant values are in a prescribed range [—/?,/?], and are quantized to k bits. The range 
corresponds to the upper and lower limits of the amplitude range of the sensors. The set ©„ 
consists of a finite number of models derived in the previous section. Here we show that 
with the number of bits k employed per transmission and p(n) properly calibrated, we have 

e- p( " m < 1 (14.3) 

0e6„ 

where for simplicity notation N g(P) — \6\is used. If @^ m) denotes the subset of 0„ consisting 
of models based on I — 3m + 1 leaf partitions, then we have 

(n-l)/3 _ (n-l)/3 
e -p(n)\8\ _ ^ ^ e -(3m+l)p(n) < ^ ^ n ^ ( - 2 * : ) 3m + 1 e" (3m+1)/ ' ( " ) 

)("0 m=0 m 

(n-D/3 „ m 

< ^_ ( -2*) 3m+1 e " (3m+1)/ ' ( " ) 



(B-D/3 

-[mlogn+(3m+l)log(2*)-(3m+l)p(«)] 

. m! 

m=0 



y - e 



If A = m logM + (3m + l)log(2*) — (3m + \)p(n) < — 1 (then e A < e 1 ), then we have 

(b-D/3 , 

y e -PM\e\ < l/ e y — < 1 (14.4) 

To guarantee A < — 1, we must have p(n) growing at least like log n. Therefore, set p(n) = 
y log«, for some y > 0. Also, as we will see later in the next section, to guarantee that 
the quantization of our models is sufficiently fine to contribute a negligible amount to the 
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overall error we must select 2 k : n 1 / 4 . With these calibrations we have A — [(7/4 — 3y) 
m + (1/4 — y)] log n. In order to guarantee that the MSE converges to zero, we will see in 
the next section that m must be a monotonically increasing function of n. Therefore, for 
n sufficiently large, the term involving (\ — y) is negligible, and the condition A < — 1 is 
satisfied by y > 7/12. In References [1 19-125] y — 2/3 is used. 

14.7.3 Upper bounds on achievable accuracy 

Assume that p(n) satisfies the condition defined by Equation (14.4) where again \9\ denotes 
the number of squares (alternatively we shall call this the number of leafs in the pruned 
tree description of the boundary) in the partition 9. It is shown in the above section that 
p(n) < Y logn satisfies Equation (14.4). Let 9„ denote the solution to 

9 n = argmin A(9, x) + 2a 2 p(n)\9\ (14.5) 

0e0„ 

where, as before, x denotes the array of measurements at the finest scale and \9\ 

denotes the number of squares in the partition associated with 9. This is essentially the 
same estimator as defined in Equation (14.2) except that the values of the estimate are 
quantized in this case. 

If 9* denote the true value of the field at resolution l/^/n [i.e. 9*(i, j) — E[Xij]] then, 
applying Theorem 7 in References [119, 124], the MSE of the estimator 9 n is bounded 
above as 

j > 1 \ ^" 

- V E{[9 n (i, j) - 9* n {i, j)] 2 } < min - 2 V [9(i, j) - 9*{i, j)] 2 + %o 2 p(n)\9\ 

(14.6) 

The upper bound involves two terms. The first term, 2^/j =l [9(i, j) — 9*(i, j)] 2 , is a 
bound on the bias or approximation error. The second term, %o 2 p(n) \6\, is a bound on the 
variance or estimation error. The bias term, which measures the squared error between the 
best possible model in the class and the true field, is generally unknown. However, if we 
make certain assumptions on the smoothness of the boundary, then the rate at which this 
term decays as function of the partition size \9\ can be determined. 

If the field being sensed is composed of homogeneous regions separated by a one- 
dimensional boundary and if the boundary is a Lipschitz function [122, 128], then by 
carefully calibrating quantization and penalization [taking k : 1/4 log n and setting p(n) = 
2/3 log n] we have [119,125] 

^ V« 

- £ E{[0 n (i, j) - 9*(i, j)] 2 } < Oy0ogn)/n] (14.7) 

>'.V'=1 

This result shows that the MSE decays to zero at a rate of V[(l°g «)/"]• 

14.7.4 System optimization 

The system optimization includes energy-accuracy trade-off. Energy consumption is de- 
fined by two communication costs: the cost of communication due to the construction 
of the tree (processing cost) and the cost of communicating the final boundary estimate 
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(communication cost). We will show that the expected number of leafs produced by the 
algorithm is 0(+Jn), and that the processing and communication energy consumption is 
proportional to this number. Having in mind MSE : n )l n \ an d ignoring the logarith- 

mic factor, the accuracy-energy trade-off required to achieve this optimal MSE is roughly 
MSE : 1 /energy. If each of the n sensors transmits its data, directly or by multiple hops, to 
an external point, the processing and communication energy costs are 0(n), which leads to 
the trade-off MSE : 1 /^/energy, since we know that no estimator exists that can result in an 
MSE decaying faster than 0(1 /^/n). Thus, the hierarchical boundary estimation method 
offers substantial savings over the naive approach while optimizing the tradeoff between 
accuracy and complexity of the estimate. 

Communication cost is proportional to the final description of the boundary, thus it is 
of interest to compute the expected size of the tree, or E[\9 1]. We construct an upperbound 
for under the assumption of a homogeneous field with no boundary. Let P denote 

the tree-structured partition associated with 0. Note that, because P is an RDP, it can have 
d + 1 leafs (pieces in the partition), where d — 3m, m — 0, ...,(« — l)/3. Therefore, the 
expected number of leafs is given by 

(n-l)/3 

E[\0\] = J2 ( - 3m+ = 3m + 1) 

m=0 

The probability Pr (\6\ = 3m + 1) can be bounded from above by the probability that one 
of the possible partitions with 3m + 1 leafs, m > 0, is chosen in favor of the trivial partition 
with just a single leaf. That is, the event that one of the partitions with 3m + 1 leafs is 
selected implies that partitions of all other sizes were not selected, including the trivial 
partition, from which the upper bound follows. This upper bound allows us to bound the 
expected number of leafs as follows: 

(n-l)/3 

E[\9\] < £ (3m+l)N m P m 

m=0 

where N m denotes the number of different (3m + l)-leaf partitions, and p m denotes the 
probability that a particular (3m + l)-leaf partition is chosen in favor of the trivial partition 
(under the homogeneous assumption). The number N m can be bounded above by (^), 
just as in the verification of the Kraft inequality. The probability p m can be bounded as 
follows. Note this is the probability of a particular outcome of a comparison of two models. 
The comparison is made between their respective sum-of-squared errors plus complexity 
penalty, as given by Equation (14.2). The single leaf model has a single degree of freedom 
(mean value of the entire region), and the alternate model, based on the (3m + l)-leaf has 
3m + 1 degrees of freedom. Thus, under the assumption that the data are i.i.d. zero-mean 
Gaussian distributed with variance a 2 , it is easy to verify that the difference between the 
sum-of-squared errors of the models [single-leaf model sum-of-squares minus (3m + 1)- 
leaf model sum-of-squares] is distributed as a 2 W3 m , where W^ m is a chi-square distributed 
random variable with 3m degrees of freedom (precisely the difference between the degrees 
of freedom in the two models). This follows from the fact that the difference of the sum-of- 
squared errors is equal to the sum-of-squares of an orthogonal projection of the data onto 
a 3m -dimensional subspace. 

The single-leaf model is rejected if u 2 W^ m is greater than the difference between the 
complexity penalties associated with the two models; that is, if a 2 W^ m > (3m + l)2a 2 
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p(n) — 2a 2 p(n) — 6ma 2 p(n), where 2a 2 p(n) is the penalty associated with each additional 
leaf in P. According to the MSE analysis in the previous section, we require p(n) = 
y logn, with y > 7/12. In References [119-125] y = 2/3, in which case the rejection of 
the single-leaf model is equivalent to W^ m > 4mlog«. The probability of this condition, 
p m — Pr(W3 m > 4m logw), is bounded from above using Lemma 1 of Laurent and Massart 
[130]: 'If Wj is chi-square distributed with d degrees of freedom, then for s > 0 Pr(Wrf > 
d + S\f2d + s 2 ) < e~ s //2 '. Making the identification d + gV2d + s 2 — 4m log n produces 
the bound 

p m = Pr(W 3m > 4ml0g«) < e -2ml°g»W[3/2(41og«-3/2)] 

Combining the upper bounds above, we have 

(B-l)/3 



E[\9\]< V (3m + 1) ( " ) e - 2ml °g"+ m V3/2(4io g „-3/2) (14 8) 

^—i \m> 

m=u 
(b-1)/3 

= V (3m + 1) ( H ) n-' n t- ,n i°g»+'"V3/2(4iog„- 
/ — \m) 



~3/2) 



For n > 270 the exponent — log n + /2(4 log « — 3 /2) < 0 and therefore 

(«-l)/3 (n-l)/3 m (n-l)/3 

£[|^|]< V (3m + l)f WN )«" m < V(3m+1)— nT" 1 < (3m + l)/m! < 11 
z — ; ' Vm/ z — i m! 

m=0 m=0 m=u 

Furthermore, note that, as n — > oo, the exponent — logw + ^/[3/2(41ogn — 3/2)] — > oo. 
This fact implies that the factor e" ffl iog«+">V[3/2(4iogn-3/2)] tends to zero when m >0 There- 
fore, the expected number of leafs £[|0|] — > lasn^oo. 

Thus, for large sensor networks, the expected number of leafs (partition pieces) in the 
case where there is no boundary (simply a homogeneous field) is one. To consider the 
inhomogeneous case where a boundary does exist, if the boundary is a Lipschitz function 
or has a box counting dimension of 1, there exists a pruned RDP with at most C'^fn 
squares (leafs) that includes the 0(*/n) squares of sidelength l/^/n that the boundary 
passes through. Thus an upper bound on the number of leafs required to describe the 
boundary in the noiseless case is given by C'*Jn. 

In the presence of noise, we can use the results above for the homogeneous case to bound 
the number of spurious leafs due to noise (zero as n grows); as a result, for large sensor 
networks, we can expect at most C'*Jn leafs in total. Thus, the expected energy required to 
transmit the final boundary description is energy = 0(*/n). 

The processing cost is intimately tied to the expected size of the final tree, as this value 
determines how much pruning will occur. We have seen above that the communication cost 
is proportional to *Jn and herein we shall show that the processing cost is also 0(^/n). 
At each scale 2)l*Jn, j = 0, . . . , 1/2 log 2 n — 1, the hierarchical algorithm passes a certain 
number of data or averages, nj, corresponding to the number of squares in the best partition 
(up to that scale), up the tree to the next scale. We assume that a constant number of 
bits k is transmitted per measurement. These k nj bits must be transmitted approximately 
2 J I^Jn meters (assuming the sensor domain is normalized to 1 square meter). Thus, the 
total in-network communication energy in bit-meters is: 

1/2 log 2 «-l 

s — k ^ nj2' l^fn 

7=0 
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65536 observations Est.,/; = 2/3 log(65536) Partition, |9| = 1 1 1 1 




1024 observations Est.,p = 2/3 log(1024) Partition, |8| = 172 



256 observations Est, p = 2/3 log(256) Partition, |6| = 70 

Figure 14.14 Effect of sensor network density (resolution) on boundary estimation. Column 
1, noisy set of measurements; column 2, estimated boundary; and column 3, 
associated partition. (Reproduced by permission of IEEE [121].) 

In the naive approach, nj = n for all j, and therefore s & kn. In the hierarchical approach, 
first consider the case when there is no boundary. We have already seen that in such cases 
the tree will be pruned at each stage with high probability. Therefore, rij — n I A' and s & 
2k^/n. Now if a boundary of length C^Jn is present, then nj < n /4 7 + C^/n. This produces 
s < kiC + T)y/n. Thus, we see that the hierarchical algorithm results in s — O(^Jn). 

Finally, a performance example is shown in Figure 14.14 [121]. 

14.8 OPTIMAL TRANSMISSION RADIUS IN SENSOR NETWORKS 

In this section we discuss the problem of finding an optimal transmission radius for flooding 
in sensor networks. On one hand, a large transmission radius implies that fewer retransmis- 
sions will be needed to reach the outlying nodes in the network; therefore, the message will 
be heard by all nodes in less time. On the other hand, a larger transmission radius involves a 
higher number of neighbors competing to access the medium, and therefore each node has 
a longer contention delay for packet transmissions. In this section we discuss this tradeoff 
in CSMA/CA wireless MAC protocols. 

Even though flooding has some unique advantages - it maximizes the probability that 
all reachable nodes inside a network will receive the packet - it has several disadvantages 
as well. Several works have proposed mechanisms to improve flooding efficiency. The 
broadcast storm paper by Ni et al. [131] suggests a way to improve flooding by trading ro- 
bustness. The authors propose to limit the number of nodes that transmit the flooded packet. 
The main idea is to have some nodes refrain from forwarding their packet if its transmission 
will not contribute to a larger coverage. Nevertheless, the basic flooding technique is in wide 
use for a number of querying techniques for sensor networks (in large part because of its 
guarantee of maximal robustness), and in this section we focus on analyzing its MAC-layer 
effects and improving its performance by minimizing the settling time of flooding. 
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Other studies have looked at the impact of the transmission radius in wireless networks. 
In Gupta and Kumar [132] the authors analyzed the critical transmission range to maintain 
connectivity in wireless networks and present a statistical analysis of the probability of 
connectivity. On the same line of work, Kleinrock and Silvester [133] analyze the minimum 
number of neighbors that a node should have to keep the network connected. 

In Takagi and Kleinrock [134], the authors describe a similar tradeoff for increasing 
the transmission radius: a shorter range implies fewer collisions and a longer range im- 
plies moving a packet further ahead in one hop. However, in that work the authors want 
to maximize a parameter called the expected one-hop progress in the desired direction, 
which essentially measures how fast a packet can reach its destination in point-to-point 
transmissions. 

All these studies were not analyzing a protocol like flooding, but instead trying to obtain 
an optimal transmission radius for other metrics such as connectivity, throughput or energy. 
In Ganesan et al. [135] an experimental testbed of 150 Berkeley motes [136] run flooding 
as the routing protocol. The study showed empirical relations between the reception and 
settling times - parameters used in this section - for different transmission ranges. 

In this section we discuss an optimal transmission radius. However, in this case the 
important metric is the amount of time that a flooded packet captures the transmission 
medium. To accomplish the goal of minimizing the settling time, the tradeoff between 
reception and contention times is studied including the interaction between the MAC-layer 
and network-level behavior of an information dissemination scheme in wireless networks. 

The network model is based on the following assumptions: 

(1) The MAC protocol is based on a CSMA/CA scheme. 

(2) All the nodes have the same transmission radius R. 

(3) The area of the network can be approximated as a square. 

(4) No mobility is considered. 

(5) The nodes are deployed in either a grid or uniform topology. In a uniform topology, 
the physical terrain is divided into a number of cells based on the number of nodes 
in the network, and each node is placed randomly within each cell. 

The analytical model is described by the following terms: 

(1) Reception time (7r) - average time when all the nodes in the network have received 
the flooded packet. 

(2) Contention time (7c) - average time between reception and transmission of a packet 
by all the nodes in the network. 

(3) Settling time (7s) - average time when all the nodes in the network have transmitted 
the flooded packet and signals the end of the flooding event. 

From these definitions we observe that T$ = 7r + 7c. If the transmission radius of the 
nodes is not carefully chosen, the flooded packet may take too long to be transmitted by 
all the nodes in the network, impacting overall network throughput. The more time the 
channel is captured by a flooding event, the fewer queries can be disseminated, and the 
less time the channel is available for other packet transmissions. We can state the relation 
between settling time and throughput Th in sensor networks as Th oc 1/Ts. So, the goal 
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is to minimize the settling time 7s. Since the settling time is the sum of the reception and 
contention times, the remainder of this section will analyze the relationships between 7r 
and 7c with respect to the range of the transmission radius. 

The reception time 7r represents the average time at which nodes received the packet. 
If the transmission radius of each node is increased, the reception time in the network 
will decrease, because there are fewer hops needed to reach outlying nodes. Therefore, the 
reception time 7r is directly proportional to the maximum distance between any two nodes 
in the network, and inversely proportional to the transmission radius. Owing to the kind of 
topologies considered here (grid or uniform), the maximum distance between the nodes is 
the diagonal of the network area. If R is transmission radius (m) and S the length of the 
side of the square area (m) then 7r = cS/R, where c is a constant. 

If a node increases its transmission radius, it will increase its number of neighbors, which 
will cause an increase in the contention time. If we consider the area covered by the network 
as S 2 then the expected number of neighbors of a given node is described by m — TtR 2 n/S 2 , 
where n is the total number of nodes in the network. However, the contention time is not 
directly proportional to above equation. There are two phenomena that influence 7",,, the 
edge phenomenon and the back-off phenomenon. 

The edge phenomenon can be described as follows: nodes close to the edges of the 
network area will not increase their number of neighbors proportionally to the square of 
the radius. The reason is that only a fraction of the area covered by its transmission radius 
intersects the area of the network. This phenomenon is illustrated in Figure 14.15, which 
shows a square topology with a given node (black point). In this figure, we can observe 
three regions as the transmission radius is increased: 

• Region 1 - when R ranges from 0 to the edge of the network (R e ). 

• Region 2 - when R ranges from R e until it covers the entire network (R w ). 

• Region 3 - when R is greater than R w . 

Each of these regions will have a different expression for the number of neighbors. For the 
first region, the number of nodes inside the transmission radius is directly proportional to 




Figure 14.15 Different regions to calculate the number of neighbors versus the transmission 
radius of the node. 
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Figure 14.16 The overlapping area between the transmission radius coverage and the ef- 
fective area of the network. 



the square of the radius. In the second region, the number of neighbors increases propor- 
tionally to the overlapping area between the transmission range and the network area. The 
overlapping area (Ao) is shown in Figure 14.16 

Defining A r as the residual area beyond S/2, because of symmetry, the total overlapping 
area is given by 



TtR 1 



where A g = A r + A t . Since 9 = arccos(S/2/?), we have A g = 9(R 2 /2) and A t = 
R 2 sin(0) cos(6>). 

As a consequence A r = 9(R 2 /2) — R 2 sin(#) cos(6>) and we get for the center (+) and 
corner position (/) 

R 2 
i 

2 



A 0 + = 



TtR 1 



R 2 sin(6>) cos(0) 



A 0+ = R 2 [rc - 49 - 4sin(0)cos(0)] 

In the case of the lower bound, we have one quarter of a circle and </> is given by <f> = 
arccos(5' / R) and 



A oZ =2 



it R 2 



9- 



R 2 



R 2 sin(0)cos(0) 



A oZ = R 2 - 9 - sin(6»)cos(0)j 



In the third region, the number of neighbors remains constant and is equal to the total number 
of nodes in the network. Since the values of R for the three different regions depend on 
the position of the node in the network, only the bounds of the edge phenomenon will be 
analyzed. 

The node closest to the center of the network is the one increasing its number of neighbors 
most aggressively, hence it represents the upper bound. For this node, the second region 
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begins when R is greater than 5/2 and the third region begins when R is greater than 
(S/2)*J2. The following equation shows the upper bound |~m]for the number of neighbors 
of this node: 



\m\ - 



R 2 

R 2 [jt -49 + 4cos(0)sin(0)] 



0 < R < - 

S S 2 r- 

- < R-y/2 

2 2 

- 2 V~2<R 



where 9 — arccos(5'/2/?). The lower bound is given by nodes located on the corners of the 
network. In this case, there is no region 1 as such but rather the number of nodes increases 
as jt(R 2 /4) and finishes when R equals S. The second region finishes when R equals S^/2. 
The next equation represents the lower bound for the number of neighbors \ m\ : 



\m\ 



R 



R 2 [^7T -6 + cos(0) sin(0)] 



0 < R < S 

S < R< SV2 
SV2 < R 



where 9 — arccos 



14.8.1 Back-off phenomenon 



In CSMA/CA protocols, a node checks if the medium is clear before sending a packet; when 
the medium is clear for a small period of time, the node transmits the packet. If the channel 
becomes busy during this waiting period, it chooses a random time in the future to transmit 
the packet. This mechanism leads to a non linear relationship between the contention time 
and the number of neighbors. This nonlinear relationship is referred to as the back-off 
phenomenon. 

By simulating a one-hop network with a varying number of nodes the nonlinear re- 
lationship between the number of neighbors and the contention time can be numerically 
approximated by [137] f(m) — J log 3 (m), where m is the number of neighbors and / is a 
constant. If we incorporate the edge and back-off phenomena explained above, we obtain 
that the upper bound for the contention time 7c is given by: 



ITcl = 



Lf{R 2 [7T -40 +4cos(0)sin(0)]} 



0 < R < 



WW 



< R < -V2 
2 

V2 < R 



and the lower bound is: 
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Figure 14.17 Analytical curve for the settling time, which is the sum of the reception and 
contention times. 

where K, L and M are constants and /(.) is the function described above. The settling 
time is the sum of the reception and contention times. Figure 14.17 shows the analytical 
settling time, for a network with four hundred nodes. For illustration purposes, the analytical 
reception and contention times are also plotted. The settling time curve shows a minimum, 
as expected. 

14.9 DATA FUNNELING 

In this section, a combination of two methods is discussed which improve the lifetime of 
energy constrained networks by reducing the amount of communication needed to send 
readings from a set of sensors to a controller. The first scheme is a packet aggregation 
technique, which has been already discussed, while the second scheme performs data com- 
pression. While either of the two schemes can be used separately, using them together 
provides maximum gain. 

The main idea behind the algorithm, called data funneling [139], is the following. The 
controller breaks up the space into different regions (e.g. cuboids) and sends interest packets 
to each region, as shown in Figure 14.8(a). Upon receiving the interest packet, each node 
in the region will start periodically sending its readings back to the controller at an interval 
specified in the interest packet, usually every few minutes. Since many or all of the nodes 
within the region will be sending their readings back to the controller at the same time, it 
would be much more efficient to combine these readings into a single packet, so that only 
one packet with only one header travels from the region to the controller. The question 
is, how can all these reading be collected at a single point and combined into a single 
packet? 

As already discussed in Section 14.3.3, the data funneling algorithm works as follows. 
The interest packets are sent toward the region using directional flooding. Each node that 
receives the interest packet checks if it is in the target region. If it is not, it computes its cost 
for communicating back to the controller, updates the cost field within the interest packet, 
and sends it on toward the specified region. This is the directional flooding phase. 
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When a node that is in the target region receives the interest packet from a neighbor node 
that lies outside the target region, the directional flooding phase concludes. The node realizes 
that it is on the border of the region and designates itself to be a border node, as shown in 
Figure 14.8(b). Each border node computes its cost for communicating with the controller in 
the same manner as was done by the nodes outside the region during the directional flooding 
phase. It then floods the entire region with a modified version of the interest packet. The 
'cost to reach the controller' field is reset to zero and becomes the 'cost to reach the border 
node field.' Within the region, each node only keeps track of its cost for communicating 
with the border node, not its cost for communicating with the controller. Intuitively, it is as 
if the border node becomes the controller of the specified region. It is at one of the border 
nodes that all the readings from within the region will be collated into a single packet. 

In addition, two new fields are added to the modified interest packet. One field keeps 
track of the number of hops that have been traversed between the border node and the 
node currently processing the packet. The other field specifies the border node's cost for 
communicating with the controller, and this field, once denned by the border node, does 
not change as the packet travels from one node to another. 

Once the nodes within the region receive the modified interest packet from the border 
nodes, they will then route their readings to the controller via each of the border nodes 
in turn. Since there are several border nodes within the region, maximizing aggregation 
of sensor readings requires all the nodes within the region to agree to route their data via 
the same border node during every given round of reporting back to the controller. This is 
accomplished by having every node compute an identical schedule of which border node 
to use during each round of reporting. This is achieved by each node in the region applying 
the same deterministic function to the vector of costs to reach the controller seen by each 
border node. Since all the nodes apply the same function to the same inputs, they will all 
compute the same schedule, allowing them to collect all of their data at one border node 
during each round of reporting. The function used to compute the schedule can be similar to 
the function used to compute the probabilities for selecting different paths in probabilistic 
routing. This allows border nodes with a low cost for communicating to the controller to be 
used more frequently than the ones with a high cost. 

As data flows within the region from the sensors to the border nodes it can be aggregated 
along the way, as shown in Figure 14.8(b). When the time comes to send a new round 
of observations back to the controller, the sensor nodes do not immediately start sending 
their packets. Instead, they wait an amount of time inversely proportional to their distance 
(in number of hops) to the border node that will be used in that round of reporting before 
sending their readings toward that border node. This allows the nodes that are far away from 
the border node to send their data earlier than the nodes that are closer to the border node. 
This way, nodes close to the border will first receive the readings from upstream nodes and 
bundle those readings with their own. In the end, all of the data to be sent out by all the 
nodes within the region will be collated at one border node and sent back to the controller 
in a single packet, as shown in Figure 14.8(b). 

If a is the ratio of bits in a packet header to the total number of bits in a packet containing 
the header and a single sensor reading for a particular application, and m is the average num- 
ber of sensor readings per transmitted packet when data funneling is employed, then the total 
energy expended by the network on communication is reduced by a x (m — 1/m) x 100% 
due to data funneling if no compression of the sensor readings is done at the aggregation 
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points. Performing compression on the sensor readings at the aggregation points within a 
region, as discussed in the sequel, would result in even greater energy savings. For this 
purpose coding by ordering is used. 

The main idea behind 'coding by ordering' is that, when transmitting many unique pieces 
of data, and the order in which the data is sent is not important to the application (i.e. the 
transmitter may choose the order in which to send those pieces of data), then the choice of 
the order in which those pieces of data are sent can be used to convey additional information 
to the receiver. In fact it is possible to avoid explicitly transmitting some of those pieces of 
data, and use the ordering of the other information to convey the information contained in 
the pieces of data that were not sent. 

Consider the case of the data funneling algorithm. In each round of reporting, the border 
node receives the packets containing sensor readings from n sensors in its region. It then 
places each node's packet (containing the node ID, which may be just the node's position, 
and payload) into a large superpacket containing the data of all the nodes and sends the 
superpacket to the controller. The border node has to include the ID of each node, which 
is unique, along with the node's sensor reading so as to make it clear which payload 
corresponds to which node. Since all of the sensor readings from the region will reach the 
controller at the same time and the ordering of the packets within the superpacket does 
not affect the application, the border node has the freedom to choose the ordering of the 
packets within the superpacket. This allows the border node to choose to 'suppress' some 
of the packets (i.e. choose not to include them in the superpacket), and order the other 
packets within the super-packet in such a way as to indicate the values contained within the 
suppressed packets. 

For example, consider the case when there are four nodes with IDs 1, 2, 3 and 4 in the 
region. Each of the four sensors generates an independent reading, which is a value from 
the set {0, . . . , 5}. The border node can choose to suppress the packet from node 4 and, 
instead, choose the appropriate ordering among the 3 ! —6 possible orderings of the packets 
from nodes 1, 2 and 3 to indicate the value generated by node 4. Note that in this case the 
border node need not encode the ID of the suppressed node because that information can be 
recovered from the fact that there are only four nodes and the packets of three of them were 
explicitly given in the superpacket. The question is, how many packets can be suppressed? 
Let n be the number of packets present at the encoder, k be the range of possible values 
generated by each sensor (e.g. if each sensor generates a 4-b value, then k — 2 4 ), and d be 
the range of node IDs of the sensor nodes. Given n, k and d, what is the largest number of 
packets, /, that can be suppressed? 

One strategy is to have the encoder (located at the border node) throw away any / packets 
and appropriately order the remaining n — I packets to indicate what values were contained 
in the suppressed packets. A total of (n — I) ! values can be indexed by ordering n — I distinct 
objects. Each of the suppressed packets contains a payload that can take on any of the k 
possible values and an ID, which can be any value from the set of d valid IDs except for the 
ones that belong to the packets included in the super packet. The values contained within 
the suppressed packets can be regarded as symbols from a (d — n + 1) x k ary alphabet, 
giving (d — n + /)' x k l possible values for the suppressed packets. In order for it to be 
possible to suppress / out of n packets in this manner, the following relationship must be 
satisfied (n — l)\ > (d — n + l) l k l \ or by using approximation, n\ — ~j2itn(n / e) n we have 



{n -/)[ln(« - i)- 1] +0.51n[27r(n - /)] - link - lln(d - n + 1) > 0 
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If this inequality is satisfied, then it is possible to suppress / packets. The suppressed 
packets can contain identical values. While their pay loads may be identical, each packet 
has to have a unique ID. Since each packet has to be identified with a unique ID from 
among the d possible IDs, and n — I of the possible IDs are taken up by the transmitted 
packets, there are ( d " +l ) possible combinations of IDs that the / suppressed packets can 
take on; therefore, when enumerating the possible values contained within the suppressed 
packets, the (d - n + I) 1 term should be replaced by ( d ~" +l ) giving (n - /)! > [ d ~" +l )k l 
as the relationship that must be satisfied in order for it to be possible to suppress / out 
of n packets. Again, approximation can be used to convert the inequality to the following 
equivalent relationship with more manageable terms: 

ln(2jr) + / + (/ + 0.5) In/ + («-/ + 0.5)ln(w - /) + (d - n + 0.5)ln(J - n) -link 
- (d-n + l + 0.5) ln(d -n + l)-n>0 

The two schemes presented above assume that the encoder will suppress / packets without 
giving much consideration to which / packets are suppressed; however, since the encoder 
has the freedom to choose which / packets to suppress, the number of values that may be 
indexed by dropping / out of n packets and ordering the remaining n — I packets increases by 
a factor of ( " ). Combining this with the previous condition gives the following relationship, 
which must be satisfied if it is to be possible to suppress / out of n packets: 



As before, applying approximation and some manipulation can reduce the inequality to an 
equivalent one: 

(n + 0.5) In n + (d - n + 0.5) \n{d -n) + 0.5 ln(2jr) - d - 0.5 \n{d -n + l) 
+ {d-n + l)[\n{d-n + l)- l] + /lnfe>0 

For example when n — 30, using the low-complexity scheme allows the encoder to suppress 
1 — 6 packets, a 20 % saving in energy spent on transmitting sensor data. The bound on 
the number of packets that can be suppressed at n — 30 is 10. As n grows, the savings also 
increase. When n — 100, the low-complexity scheme provides 32 % savings, the higher- 
complexity scheme guarantees 44 % savings, while the bound is 53 %. 



14.10 EQUIVALENT TRANSPORT CONTROL PROTOCOL 
IN SENSOR NETWORKS 

The need for a transport layer for data delivery in WSN can be questioned under the premise 
that data flows from source to sink are generally loss tolerant. While the need for end-to-end 
reliability may not exist due to the sheer amount of correlated data flows, an event in the 
sensor field still needs to be tracked with a certain accuracy at the sink. So, instead of a 
traditional TCP layer, the sensor network paradigm necessitates an event-to-sink reliability 
notion at the transport layer that will be referred to as equivalent TCP (ETCP) [143-148]. 
Such a notion of collective identification of data flows from the event to the sink was 
illustrated earlier in Figure 14.10. An example of ETCP is event-to-sink reliable transport 
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(ESRT) protocol for WSN, discussed in Sankarasubramaniam et al. [123]. Some of its 
features are 

(1) Self-configuration - ESRT is self-configuring and achieves flexibility under dynamic 
topologies by self-adjusting the operating point. 

(2) Energy awareness - if reliability levels at the sink are found to be in excess of that 
required, the source nodes can conserve energy by reducing their reporting rate. 

(3) Congestion control - required event detection accuracy may be attained even in the 
presence of packet loss due to network congestion. In such cases, however, a suitable 
congestion control mechanism can help conserve energy while maintaining desired 
accuracy levels at the sink. This is done by conservatively reducing the reporting 
rate. 

(4) Collective identification - ESRT does not require individual node IDs for operation. 
This is also in tune with ESRT model rather than the traditional end-to-end model. 
More importantly, this can ease implementation costs and reduce overhead. 

(5) Biased Implementation - the algorithms of ESRT mainly run on the sink with min- 
imum functionalities required at sensor nodes. 

In another example [141, 142], the PSFQ mechanism is used for reliable retask- 
ing/reprogramming in WSN. PSFQ is based on slowly injecting packets into the network, 
but performing aggressive hop-by-hop recovery in case of packet loss. The pump opera- 
tion in PSFQ simply performs controlled flooding and requires each intermediate node to 
create and maintain a data cache to be used for local loss recovery and in-sequence data 
delivery. Although this is an important transport layer solution for WSN, it is applicable 
only for strict sensor-to-sensor reliability and for purposes of control and management in 
the reverse direction from the sink to sensor nodes. Event detection/tracking in the for- 
ward direction does not require guaranteed end-to-end data delivery as in PSFQ. Individual 
data flows are correlated and loss-tolerant to the extent that desired event features are 
collectively and reliably informed to the sink. Hence, the use of PSFQ for the forward 
direction can lead to a waste of valuable resources. In addition to this, PSFQ does not ad- 
dress packet loss due to congestion. For this reason in the sequel we elaborate more ESRT 
algorithm. 

The operation of the algorithm is based on the notion of observed event reliability, r, 
(the number of received data packets in decision interval i at the sink), and desired event 
reliability, R (the number of data packets required for reliable event detection). R depends 
on the application. If the observed event reliability, r, , is greater than the desired reliability, 
R, then the event is deemed to be reliably detected. Else, appropriate action needs to be 
taken to achieve the desired reliability, R. With the above definition, r, can be computed 
by stamping source data packets with an event ID and incrementing the received packet 
count at the sink each time the ID is detected in decision interval i. Note that this does not 
require individual identification of sensor nodes. Further, we model any increase in source 
information about the event features as a corresponding increase in the reporting rate, /, of 
sensor nodes. The reporting rate of a sensor node is defined as the number of packets sent 
out per unit time by that node. The transport problem in WSN is to configure the reporting 
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Figure 14.18 The five characteristic regions in the normalized reliability, r\, vs reporting fre- 
quency, /, behavior. Number of source nodes-8 1 . (Reproduced by permission 
of IEEE [140].) 



rate, f, of source nodes so as to achieve the required event detection reliability, R, at the 
sink with minimum resource utilization. 

In order to study the relationship between the observed reliability at the sink, r, and the 
reporting frequency, /, of sensor nodes, Sankarasubramaniam et al. [140] developed an 
evaluation environment using ns-2 . The parameters used in the study are: 

Area of sensor field: 100 x 100 m 2 
Number of sensor nodes: 200 
Radio range of a sensor node: 40 m 
Packet length: 30 bytes 
IFQ length: 65 packets 
Transmit power: 0.660 W 
Receive power: 0.395 W 
Decision interval ( x ): 10 s 

Event centers (X ev , Y ev ) were randomly chosen and all sensor nodes within the event radius 
behave as sources for that event. Let the desired reliability as laid down by the application 
be R. Hence, a normalized measure of reliability is r\ — r/R. Parameter, n, denotes the 
normalized reliability at the end of decision interval The results of the above experiment 
are shown in Figure 14.18. 

The aim is to operate as close to r\ — 1 as possible, while utilizing minimum network 
resources (/ close to /* in Figure 14.18). We call this the optimal operating point, marked 
as P\ in Figure 14.18. For practical purposes, we define a tolerance zone of width 2s around 
Pi . Here, e is a protocol parameter to be optimized. Although the event is reliably detected 
at P2 too, the network is congested and some source data packets are lost. Event reliability 
is achieved only because the high reporting frequency of source nodes compensates for this 
congestion loss. However, this is a waste of limited energy reserves and hence is not the 
operating point of interest. Similar reasoning holds for 77 > 1 + e. 
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From Figure 14.25, we identify five characteristic regions (bounded by dotted lines) 
using the following decision boundaries: 

(NC,LR): / < / max and r\ < 1 — e (no congestion, low reliability) 
(NC,HR): / < / max and r\ > 1 + e (no congestion, high reliability) 
(C,HR): / > / max and r\ > 1 (congestion, high reliability) 
(C,LR): / > / max and r\ < 1 (congestion, low reliability) 
OOR: / < / max and 1 — s < rj < I + s (optimal operating region) 

As seen earlier, the sink derives a reliability indicator r\i at the end of decision interval 
;'. Coupled with a congestion detection mechanism (to determine /max), this can help 
the sink determine in which of the above regions the network currently resides. Hence, 
these characteristic regions identify the state of the network. Let 5, denote the network state 
variable at the end of decision interval i. Then, 

St G {(JVC, LR), (JVC, HR), (C, HR), (C, LR), OOR} 

The operation of ESRT is closely tied to the current network state Si . The ESRT protocol 
state model and transitions are shown in Figure 14.19. ESRT identifies the current state 5, 
from: ( 1 ) reliability indicator r) t computed by the sink for decision interval i ; (2) a congestion 
detection mechanism, using the decision boundaries as defined above. Depending on the 



/S/max;'?> 1 - £ 




Figure 14.19 ESRT protocol state model and transitions. (Reproduced by permission of 
IEEE [27].) 
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current state and the values of /; and r] t , ESRT then calculates the updated reporting 
frequency f i+ \ to be broadcast to the source nodes. At the end of the next decision interval, 
the sink derives a new reliability indicator r) i+ \ corresponding to the updated reporting 
frequency f i+ \ of source nodes. In conjunction with any congestion reports, ESRT then 
determines the new network state S i+ \. This process is repeated until the optimal operating 
region (state OOR) is reached. The state model of the ESRT protocol and state transitions 
are shown in Figure 14.19. The following reporting rate updating rules are used [140]: 



(NC,LR) 


fi+l = 


n 








(NC,HR) 


fi+l = 




(C,HR) 


fi+l = 


A 






m 


(C,LR) 


fi+l — 





where k denotes the number of successive decision intervals for which the network has 
remained in state (C,LR) including the current decision interval 

OOR f i+i = fi 

In order to determine the current network state 5, in ESRT, the sink must be able to detect 
congestion in the network. ESRT uses a congestion detection mechanism based on local 
buffer level monitoring in sensor nodes. Any sensor node whose routing buffer overflows 
due to excessive incoming packets is said to be congested and it informs the sink of the 
same. For more on system performance see Sankarasubramaniam et al. [140]. 
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Security 



A necessary component of network security is the ability to reliably authenticate commu- 
nication partners and other network entities. For this reason we will start this chapter by 
discussing authentication protocols [1-19]. The focus will be on a systematic approach to 
the design of the protocols rather then describing a specific protocol used in practice. The 
following session will discuss the security architectures. Principles of key distribution will 
be covered in the third section followed up by some specific solutions in ad hoc networks, 
sensor networks, GSM and UMTS systems. 

15.1 AUTHENTICATION 

Many designs dealing with authentication in networks or distributed systems combine the 
issues of authentication with those of key distribution. These designs typically assume that 
all network parties share a key with a common trusted entity, a key distribution center 
(KDC), from which they can get pair-wise shared keys to carry out mutual authentication 
protocols. These protocols are called three-party authentication protocols, and have been 
studied extensively [5, 6, 10-12, 15, 16]. Most of the corresponding implementations [10] 
require the exchange of long messages, which is possible for application layer protocols, but 
makes them unsuitable for use in lower-layer networking protocols where limited packet 
sizes are an important consideration. Some require synchronized clocks or counters [10] 
that pose system management and initialization issues, as will be discussed below. 

Two-party authentication protocols are used in many networks. Some of these use public 
key cryptography [1-3, 18]. With a public key cryptographic system, each party only has 
to know and verify the public key of the other party, and there is no need to share secret 
keys. The authentication method most widely used in network environments today consists 
of asking users to prove their identity by demonstrating knowledge of a secret they know, 
a password. This widespread but old technique has several weaknesses. 
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In most password systems, the password typed by a user is sent in cleartext over the 
network to the computer that requested it. This means that an intruder can spy on passing 
traffic to collect passwords. Since users need to memorize their passwords, they typically 
select passwords that they can easily remember. So, such passwords are selected from within 
a relatively small vocabulary, and are thus easily guessed by potential intruders [4]. 

Of all authentication techniques that circumvent the drawbacks of passwords, the most 
promising ones are those using cryptographic means. With such techniques, users are typ- 
ically provided with smart cards or chip cards equipped with a processor capable of cryp- 
tographic operations. Authentication is based upon using the card to compute or verify 
cryptographic messages exchanged with the computer. 

Cryptographic authentication consists of challenging the user or communicating party 
being authenticated to prove its identity by demonstrating its ability to encipher or decipher 
some item with a secret or private key known to be stored inside the smart card. Of course, 
since the secret or private key stored on the card or secure device changes infrequently, 
the item to be enciphered or deciphered with it must change for every execution of the 
authentication protocol, otherwise, even though the secret never flows in cleartext over the 
network, an intruder could still tap a line, record the cryptic message that flows over it, and 
play that recording back at a later time without even knowing what it means. 

To guarantee that the item that gets enciphered (or deciphered), called the challenge, is 
different for every execution of the authentication protocol, three techniques are used. The 
challenge may be derived either from a real-time clock reading, in which case it is called 
a timestamp, from a counter that is incremented for every protocol execution, or from a 
random number generator, in which case it is called a nonce. In any case, a new challenge 
(timestamp, counter value or nonce) is generated for each protocol run. 

With timestamps, the user being authenticated, A, enciphers the current reading of its 
clock and sends the result to the party requesting the authentication B. B then deciphers the 
received message, and verifies that the timestamp corresponds to the current realtime. The 
drawback of timestamps is thus immediately apparent. A and B must have synchronized 
real-time clocks for the verification to be possible. However, since clocks can never be 
perfectly synchronized and messages take time to travel across networks anyway, B cannot 
expect that the deciphered timestamp received from A will ever be equal to its own real-time 
clock reading. A's time-stamp at best can (and normally should) be within some limited 
time window of B's real-time clock. However, as soon as a time window of tolerance is 
defined, a potential intruder could exploit it to impersonate A by replaying one of A's 
recent authentication messages within that time window. Preventing this requires putting a 
limit to the number of authentication protocol runs allowable during the time window, and 
having B save all the authentication messages within the window. Thus, both efficiency 
and security require pretty good clock synchronization. Achieving such synchronization is 
often difficult, and making it secure would itself require using some other authentication 
method not depending on time. 

With counting challenges, A and B maintain synchronized counters of the number of 
times they have authenticated one another. Every time A wants to communicate with B, it 
enciphers its counter and sends the result to B, who deciphers it and verifies that it matches 
its own counter value, whereupon both parties increment their respective counters for the 
next time. The drawback of such counting challenges is that both parties must maintain 
synchronized counters, which poses stable storage and counter management problems in 
the long run. The counters must be long enough to prevent an attacker from waiting for 



AUTHENTICATION 591 



a (deterministic) counter wraparound; using counters complicates resolution of conflicts 
when both parties want to initiate protocol executions at the same time; after detecting loss 
of synchronization between two counters, some other method of authentication must be 
used to securely resynchronize them; and an undetected error in the counter values may 
have catastrophic effects. 

Therefore, both timestamps and counter techniques, while useful, are not a complete 
solution, especially for protocols in the lower layers of a network architecture. The best 
technique for this purpose, and also the simplest one to implement, consists of using nonces. 
The price for this simplicity is an extra network message. While A can authenticate itself 
to B with a single message if timestamps or counters are used, two messages are necessary 
with nonces. Indeed, it is B and not A who needs to be sure that the nonce has never been 
used before. Thus, it is B and not A who must generate it. B must encipher it and send it 
to A, then A must decipher it and send it back in cleartext for B to authenticate it, which 
costs a total of two messages. The cost of one more message is usually tolerable, especially 
since the extra message can often be piggy-backed onto regular traffic or its cost can be 
amortized over many subsequent messages authenticated with sequence counters. Given 
these advantages of nonces over timestamps and counters, the rest of this section focuses 
on nonce-based authentication protocols. 

Figure 15.1 is a simple representation of the nonce-based one-way authentication proto- 
col, where N is a nonce generated by B and Ea(N) is its value enciphered under some key 
Ka that B knows is associated with A. It may be a secret key shared between A and B or 
the public key of A. The protocol allows B to authenticate A. A number of modifications 
are shown in the same figure. 




Figure 15.1 Authentication protocols: (a) one-way authentication of A; (b) one-way au- 
thentication of B; (c) two-way authentication; and (d) two-way authentication 
with three messages. 
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The simple and elegant two-way authentication protocol, which was derived most nat- 
urally by combining two instances of the simplest possible one-way protocol, presents a 
number of undesirable characteristics. In particular, we will show that an intruder may break 
this protocol by interleaving messages from different executions of it, if the same shared 
key is used by both parties or if the same key is always used by the party who starts the 
protocol. One could still argue that the protocol is safe if used with four different keys; 
however, this creates additional overhead and key management issues, as well as migration 
problems for existing designs. 

15.1.1 Attacks on simple cryptographic authentication 

The simple two-way authentication protocol illustrated in Figure 15.1(d) suffers from a 
number of defects. 

15. 1. 1. 1 Known plaintext attacks 

A first weakness of the protocol is its openness to known plaintext attacks. Every enciphered 
message flowing between A and B is the ciphertext of a bit string (the nonce) that flows 
in plaintext in a subsequent message between A and B. This enables a passive wiretapping 
intruder to collect two cleartext-ciphertext pairs every time the protocol is run, which at 
the very least helps it accumulate encryption tables in the long run, and may even help it 
break the scheme and discover the encryption key, depending on the quality of the encryption 
algorithm used. It is in general desirable that the plaintext of exchanged enciphered messages 
not be known or derivable by intruders. 

15. 1. 1.2 Chosen ciphertext attacks 

A potential intruder may even turn known plaintext attacks into selected text attacks by 
playing an active instead of passive role. Pretending that it is A or B, an intruder may 
send the other party (B or A) a ciphertext message that it selected itself and wait for that 
other party to reply with the deciphered value of that text. Of course, the intruder, not 
knowing the right key, may not be able to complete the third protocol flow. However, it can 
accumulate knowledge about cleartext-ciphertext pairs of which it selected the ciphertext 
itself {or the cleartext, if the challenge was enciphered instead of deciphered). So it may try 
specific ciphertext strings such as all zeros, all ones, or whatever else might help it break 
the key faster, depending on the cryptographic method used. It is in general desirable that 
an intruder not be able to trick legitimate parties into generating deciphered versions of 
selected ciphertext messages (or enciphered versions of selected cleartext). 

15. 1. 1.3 Oracle session attacks 

If A and B use the same key in the simple protocol suggested earlier, the intruder actually 
can break the authentication without even breaking the key. This is illustrated in Figure 
15.2, where the intruder, noted X and posing as A, starts a session with B by sending 
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A X B 

E(N1) > 

Nl, E(N2) 



E(N2) 



N2,E(N3) 



N2 



Abandon session 



Figure 15.2 An oracle session attack. 




Figure 15.3 Improved two-way protocol. 

B some enciphered nonce E(N\). B replies with the deciphered value A^l and its own 
enciphered challenge E(N2). X cannot decipher N2, but it can take advantage of a selected 
ciphertext attack on A, using A as an oracle who will provide the necessary deciphered 
value N2.X accomplishes this by posing now as B to start a separate 'oracle' session with 
A, sending E(N2) as the initial message on that session. A will reply with the needed value 
N2 and some own enciphered nonce E(N3). X then drops the oracle session with A since 
it cannot and does not care about deciphering N3, but it then turns around and successfully 
assumes its faked identity A with respect to B by sending N2 to B. This may be prevented 
by modifying protocol as shown in Figure 15.3. This is an ISO SC27 protocol [19]. 

15. 1. 1.4 Parallel session attacks 

Another defect commonly found in simple protocols, such as those seen above, is depicted in 
Figure 15.4, where the intruder assumes a passive role instead of an active one. It intercepts 
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N2 




Figure 15.4 A parallel session attack. 




Figure 15.5 An asymmetric two-way authentication. 

a call from A to B with challenge Nl and since it cannot answer the challenge Nl by 
replying E(Nl), it simply pretends that it is B trying to start a parallel session with A. 
Of course, it selects to use just the same N 1 as the first challenge on the original session, 
thus causing A to provide it with precisely the answer E(N1) needed to complete the first 
authentication. The remaining communication shown in the figure will complete the second 
authentication as well. 

Parallel session attacks illustrate another fundamental flaw of many simple authentication 
protocols: the cryptographic expression used in the second message must be asymmetric 
(i.e. direction-dependent) so that its value in a protocol run initiated by A cannot be used in 
a protocol run initiated by B. Based on the above observations, i.e. that the cryptographic 
message in the second flow must be asymmetric (direction-dependent) and different from 
the one in the third flow, one may think that a protocol such as the one in Figure 15.5 should 
be secure. Unfortunately, in addition to that the fact that known- and selected-text attacks 
are still possible, the problem now is that the simple function XOR has not fixed anything 
since an intruder can still resort to a parallel session attack. It merely needs to use proper 
offsetting in addition, as illustrated in Figure 15.6. 

The simple and apparently sound protocol of Figure 1 5 . 1 (d) is thus broken in many ways. 
In the following, oracle-session attacks, parallel-session attacks, offset attacks and/or other 
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N2 



Figure 15.6 An offset attack (through a parallel session). 



types or combinations of attacks that involve replaying messages and functions of challenges 
observed in other runs of the protocol will be collectively referred to as interleaving attacks. 
Based on the above observations we can specify the following design requirements for 
authentication protocols: 

(1) nonce-based (avoid synchronization and stable counter storage); 

(2) resistant to common attacks; 

(3) usable at any layer of any network architecture (small messages); 

(4) usable on any processing base (few computations); 

(5) using any cryptographic algorithm, either symmetric ones, such as DES [20] or 
asymmetric ones such as RSA [7]; 

(6) exportable - the chance to receive proper licensing for a technology is larger if it 
provides only message authentication code (MAC) functions and does not require 
full-fledged encryption and decryption of large messages; 

(7) extendable - it should support the sharing of secret keys between multiple users and 
should allow additional fields to be carried in the messages, which would thus be 
authenticated as part of the protocol. 



15.1.2 Canonical authentication protocol 

A number of protocols meeting the above requirements have been developed [13,14]. Some 
of these protocols are summarized in Figure 15.7. Given that the focus is on authentication 
protocols using nonces, the most general canonical form for all such protocols is depicted 
in Figure 15.7 (PI). Here, A sends some nonce Nl to B.B authenticates itself by sending 
back a function u( ) of several parameters, including at least its secret K 1 and the challenge 
Nl it received from A. It also sends a challenge N2 to A. A finally completes the protocol 
by authenticating itself with a function v( ) of several parameters including at least its secret 
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Figure 15.7 PI, resistance to replay attacks through use of nonces; P2, use of symmet- 
ric cryptography; P3, resistance to parallel session attacks; P4, resistance to 
selected-text and offset attacks; P5, minimal number of encryptions. 



K2 and the challenge N2 from B. As indicated in the earlier section on attacks, in order for 
this protocol to resist oracle attacks, the functions u{ ) and d( ) must be different from one 
another, meaning that a cryptographic message from flow (2) can never be used to derive 
the necessary message for flow (3). 

The protocols must work with either symmetric or asymmetric cryptographic systems. 
With the latter, the secrets K 1 and K2 would be the private keys of B and A, respectively. 
However, with symmetric systems, ^1 and K2 would be shared secret keys. In fact, in 
many typical scenarios, they would be the same shared secret key. In the sequel we will 
continue under that assumption because it opens the door to attacks that would not be 
possible otherwise, yet it has advantages of efficiency and ease of migration of existing 
systems over solutions which use different keys for A and B. Any protocol that resists 
interleaving attacks using equal symmetric keys is also safe with asymmetric ones while 
the opposite is not true. 

Under the assumption of symmetric cryptography with a single shared secret key K 
protocol PI becomes as depicted in Figure 15.7 (P2), where the functions u(K 1 , decryption) 
are represented by operations E( ) under the same key K of two different functions p{ ) 
and q{ ) of the remaining parameters. As indicated earlier in the section on attacks, the 
prevention of parallel session attacks suggests that function p( ) must be asymmetric, i.e. 
direction-dependent. In other words, the arguments to p( ) must be different depending on 
whether A or B starts the communication session. This is depicted in Figure 15.7 (P3) by 
the addition of a parameter D in p( ), which stands for anything indicating or tied to the 
direction of the flow (e.g. the name or address of one of the parties). 

As shown in Figure 15.7 (P3), the complete protocol P3 requires three messages. In 
any networking environment, these messages need not travel in packets of their own. In 
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practice, they can and typically would be piggy-backed onto other network packets, such 
as, for instance, connection requests and confirmations at whatever layer entities are being 
authenticated (e.g. link, network, transport, application, etc.). Within packets at any such 
layer, the authentication fields should take as little space as possible. As it stands presently, 
the canonical protocol requires a nonce to be carried on the first flow, a nonce and a 
cryptographic expression on the second one, and a cryptographic expression on the third 
one. The cleartext of the directional parameter, e.g. party name or address, is implicitly 
known or already present in the enclosing packet headers. The required size of nonces is 
given by the amount of security desired: the larger they are, the lower the probability will 
be that a given nonce gets reused. The size of the cryptographic expressions depends on the 
encryption algorithm used and on the amount of information being encrypted. 

As an example, if the DES is used, the smallest cryptographic expression will be 64 b 
(key 56 b) long. Since the security of the whole authentication protocol rests on the un- 
forgeability of the cryptographic expressions, if expressions of 64 b are used, the statistical 
security, ss, of the expressions is ss — 1/ ^(expression) = 2 64 , the inverse of the probability 
that an intruder can successfully guess a cryptographic expression. For many environments, 
such a degree of security is already quite satisfactory, although longer cryptographic ex- 
pressions are already being used. Triple DEA (ANSI X9.17, DEA 1999) uses three keys 
and three executions of DEA with effective key length 112 and 168 b. Advanced encryption 
standard (AES, 1997) uses blocks of length 128 b and key lengths 128, 192 and 256 b. If 
64 b nonces are used, p( ) and q{) may be restricted to 64 b functions of their parameters 
without compromising the degree of security. We further make this simplifying assumption. 

Restricting p{ ) and q( ) to 64 b functions to limit message size has its own implica- 
tions. Many simple and attractive 64 b functions of only Nl and D (e.g. XOR, rotations, 
conditional permutations, etc.) could be subjected to selected-plaintext and offset attacks 
by an intruder. By including inside function p( ) an additional internal encrypted function 
of the same parameters Nl and D, one can separate these parameters cryptographically 
and thus bar the way to offset attacks. By including N2 inside function p( ), one bars the 
way to a potential intruder using B as an oracle to get some selected plaintext enciphered 
in the second flow because the cleartext of that flow then depends on nonce N2 which is 
not under the control of the intruder initiating a protocol run. These further conditions on 
p( ) are represented in Figure 15.7 (P4), where p( ) would thus be a 64 b function (noted #) 
of two operands, which themselves include functions /( ) and g( ) of Nl, N2 and D. 

The different fields in the message should be cryptographically separated, so that the 
attacker should not be able to control one field through another field. This separation should 
be ensured by an appropriate selection of the functions /( ) and g( ). We will later give 
exact conditions on /( ) and g( ) to prevent interleaving attacks, and give specific examples 
for these functions. 

Any peer authentication protocol requires at least two cryptographic expressions. In 
Figure 15.7 (P5) a third cryptographic operation is used to protect against offset attacks. 
To keep the protocol simple, by putting additional conditions on q( ), we can return to 
a protocol requiring only two cryptographic block operations. This can be achieved by 
imposing that the functions g( ) and q{ ) actually be the same, as represented in Figure 15.7 
(P5). Indeed, in this case, the innermost cryptographic expression required to produce or 
verify the second flow is the same as the cryptographic expression of the third flow, and can 
thus be computed only once and saved to minimize computation. 
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Key generation 

Select p,q (p,q prime) 
Calculate n=pq 
Calculate <p(n)=(p-l)(q-l) 
Select integer e (gcd (<p(n),e)=l, l<e<</)(n)) 
Calculate d (d=e~ mod (j)(n)) 
Public key KU=[e,n] 
Private key KR=[d,n] 
Encription 
Plaintext M<n 
Ciphertext C= M e (mod«) 
Decription 
Ciphertext C 
Plaintext M= C d (mod n) 



(b) J^^^^^^^m 

p=17 
q=ll 

n=pq=187 
0(n)=(p-l)(q-l)=16O 

e=7 

d=23 (demod 160=1) 
KU= [7,187] 
KR= [23,187] 
M=88 
Ciphertext 

C= 88 7 modl87= 11 
Plain text 

M= ll 23 modl87 = 88 



Figure 15.8 (a) Operation of a public key encryption system; (b) numerical example of 
operation of a public key encryption system. 



Finally, it may be instructive at this point to describe the operation of public key encryp- 
tion system used for authentication. In this case user generates pair of keys and places one 
key in the public domain. To send a message the user encrypts using the public key and 
decrypts using a private key as illustrated in Figure 15.8(a); a numerical example is given 
in Figure 15.8(b). 
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15.2 SECURITY ARCHITECTURE 

Cryptography provides only two fundamental services, namely, confidentiality and authen- 
tication. All communications security services are concerned with the identity of the senders 
or recipients of information. Cryptography allows users to be identified by allocating secret 
keys to them. There are only two ways that cryptography can work, and these define the 
fundamental services [22, 23]: 

• Confidentiality - only the user (or set of users) in possession of the secret key can read 
the message; 

• Authentication - only the user (or set of users) in possession of the secret key can write 
the message. 

A secure channel can be thought of as a relationship between two system users which 
provides some security services. 

In this section, two basic types of secure channel are considered, namely, confidentiality 
channels and authentication channels. In addition, we define symmetric channels, each of 
which may coincide with a confidentiality channel, an authentication channel or both. 

In a conventional symmetric cipher, a pair of users share the same key. This symmetric 
channel typically provides both a confidentiality and an authentication channel between a 
specific pair of users. Symmetric ciphers can also be used to provide authentication alone, 
as when a message authentication code is used [25]. 

A public key cryptosystem, as described above, usually has one public key and a corre- 
sponding secret key. For certain algorithms, such as RSA [31], the public key may be used 
for confidentiality and the secret key may be used for authentication. However, for other 
algorithms, it may be that only one channel is provided. For example, signature schemes 
provide authentication but not confidentiality, while schemes such as the McEliece algorithm 
[25] are suitable for authentication but not confidentiality. 

There are three components that make up a security architecture in the model of this 
paper: (1) users; (2) trusted users, including information on who trusts whom; and (3) 
secure channels, which may provide confidentiality authentication or both. 

The, formal model is a relatively simple specification and defines a state-based sequential 
system as described in Spivey [32]. The first line of the specification defines the abstract 
types which will be used. These are users and keys [User, Key] and they are fundamental 
values in the model. Other components, such as secure channels and trusted users, will be 
defined in terms of these sets. 

Next, some global, or axiomatic, definitions are made. These are things that are not 
expected to change within the model and so can be excluded from the rest of the system 
state. Five sets of keys are defined concerning two separate properties. A key must be in 
exactly one of the sets public, secret, shared; in other words, these sets partition (ft^) the 
keys. In addition, a key must be a confidentiality key (in the set Conf) or an authentication 
key (in the set Auth) or both. The dual, or inverse, is defined for each key. Taking the dual 
of a key is a self-inverse operation. The dual of a confidentiality key is still a confidentiality 
key and similarly for authentication keys. Secret and public keys are interchanged under 
the dual map, while the dual of a shared key is still shared. The trusted users are defined 
by a map which defines those users trusted by each user. For a formal representation of the 
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Table 15.1 Special Z symbols 



Symbol 


Meaning 


/: X Y 


Function between X and F 


f:X±>Y 


Relation between X and 7 


id X 


The identity function on X 


dom / 


The domain of / 


ran / 


The range of / 


f(\x\) 


Image of the set X under the function / 


J ~ 6 


Function which takes values of the function f excent on the domain of p 




where it takes the values of g 


f °8 


Functional composition where the domain of g must equal the range of/ 


X\Y 


Set difference of X and Y 


\^x 


the set of sets with elements of X as values 



system components, notation from Table 15.1 known as Z notation [32] is used, resulting 
in [21]: 



Shared, Public, Secret, Auth, Conf: ^ Key 
dual : Key — > Key 
Trusted : User — >■ User 

(Shared, Public, Secret) partition Key 
Auth U Conf — Key 
dual o dual — id Key 
dual(\Conf\) = Conf 
dual(\Auth\) — Auth 
dual(\Shared\) — Shared 
dual(\Public\) — Secret 
dual(\Secret\) = Public 

The first ordinary schema defines the variable that records what keys are known by each 
user, and with whom they are associated. 

-Keys 

keys : User — > ^ (User x Key) 



A set of (user, key) pairs is associated to each user, where 'x maps to (y, Ic)' means that 
x knows key k and uses it in communications with user v. The following three schemas 
define the state space of the model by giving formal definitions of secure channels in terms 
of possession of keys. 
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—Confidentiality channels 

Keys 

ConfChannels : User^r User 

Vi, y : User* (x, y) € ConfChannels <^ 
{3k: Conf\ Secret; z: 

User* (y, k) € keys(x) A [z, dual(k)] € /ceys (y)} 

Confidentiality channels define relations between pairs of users. These are ordered pairs 
as the channel may be in only one direction. It will be noticed that the definition is not 
symmetrical with respect to x and y. The predicate states that, for a confidentiality channel 
to exist from x to y, there must be a key whose use includes confidentiality and is either 
shared or public; x must associate this key with y; y must know the dual of the key. In the 
model, this means that y must associate it with some user(s), but it is not of any concern 
to y exactly which users know the key. To see why this is reasonable, consider a public 
key system providing confidentiality to y. Any user z who knows the public key of y has 
a confidentiality channel to y, and it is important to z that it is only y who has the secret 
dual key; y must of course know the secret key, but it does not matter to y who knows the 
public key. This corresponds to the viewpoint that confidentiality is a service provided to 
the sender of information. 

—Authentication channels 

Keys 

AuthChannels : User —> User 

Vx, y : User* (x, y) e AuthChannels^ 
{3k : Auth \ Public; z '■ User* 

(z, k) e keys(x) A [x, dual(k)] e keys{y)} 

Authentication channels also define relations between pairs of users. The definition of 
authentication channels is dual to the definition of confidentiality channels and corresponds 
to the viewpoint that authentication is a service provided to the receiver of information. 

—Symmetric channels 

Keys 

SymmChannels : ^{^(User)} 

SymmChannels c {x, y : User\x ^ y • {x, y}} 

V x, y : User • (x, y) e SymmChannels 

{(3k : Shared* (y, k) e keys(x) A [x, dual(k)] e keys(y)} 
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Symmetric channels are in both directions and so are defined as sets of two different 
users. They correspond to the situation where neither key is public, and in practice the key 
and its dual are usually equal. 

-State 

ConfidentialityChannels 
AuthenticationChannels 
SymmetricChannels 

The system state is defined exactly by what keys are known by each user, thereby defining 
what secure channels exist. 

—Transfer 

AState 

origl, desfl, recipl, sender? : User 
kl : Key 

(origl, k?) G keys (sender!) 

keys' — keys® {recipl — > keys(recipl) U (origl, kl)} 

This schema says that if a key kl is sent from one user to another, then the keys known to 
the recipient are updated to associate the key sent with the originator. In this model, the only 
state changes are those which happen as a result of passing keys from one user to another. 
Such a key exchange may or may not result in new channels being formed. The key passes 
from the sender to the recipient. It may be that the users between whom communication is 
intended (originator and destination) are different from the sender and recipient involved in 
a particular exchange. This is the situation if the sender is a key server. The recipient will 
therefore associate the received key with the originator, who may or may not be the sender. 

SecureTransfer 

Transfer 

kl G Secret (senderl, recipl) G ConfChannels 
kl G Public =>• (senderl, recipl) € AuthChannels 
kl G Shared =>• (senderl, recipl) G ConfChannels PI 

AuthChannels 
origl ^ senderl A kl G Public U Shared =>• 

senderl G Trusted(recipl) 
recipl ^ destl A kl G Secret U Shared 

recipl G Trusted(senderl) 
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This schema specifies the conditions which must exist if a key exchange should be 
performed securely: (1) secret keys may only be transferred over a confidentiality channel; 
(2) public keys must be transferred over authentication channels; (3) if the key is to be 
shared, the channel should provide both confidentiality and authentication since both users 
need to associate the key only with each other; (4) if the key is public or shared then the 
recipient must trust the sender, unless the sender is the originator; this is because the key 
must be correctly assigned to the originator; and (5) if the key is secret or shared, then the 
sender must trust the recipient not to reveal it, unless the recipient is the destination. More 
details on formal protocol presentation can be found in References [21-32]. 



15.3 KEY MANAGEMENT 

Information protection mechanisms, discussed so far in this chapter, assume cryptographic 
keys to be distributed to the communicating parties prior to secure communications. The 
secure management of these keys is one of the most critical elements when integrating 
cryptographic functions into a system, since even the most elaborate security concept will 
be ineffective if the key management is weak. 

An automatic distribution of keys typically employs different types of messages. A 
transaction usually is initiated by requesting a key from some central facility (e.g. a key 
distribution center, KDC), or from the entity a key is to be exchanged with. Cryptographic 
service messages (CSMs) are exchanged between communicating parties for the trans- 
mission of keying material, or for authentication purposes. CSMs may contain keys, or 
other keying material, such as the distinguished names of entities, key-IDs, counters or 
random values. CSMs have to be protected depending on their contents and on the security 
requirements. Generic requirements include the following: 

(1) Data confidentiality should be provided while secret keys and possibly other data 
are being transmitted or stored. 

(2) Modification detection prevents the active threat of unauthorized modification of 
data items. In most environments, all cryptographic service messages have to be 
protected against modification. 

(3) Replay detection is to counter unauthorized duplication of data items. 

(4) Timeliness requires that the response to a challenge message is prompt and does 
not allow for playback of some authentic response message by an impersonator. 

(5) Entity authentication is to corroborate that an entity is the one claimed. 

(6) Data origin authentication {proof/nonrepudiation of origin) is to make certain that 
the source of a message is the one claimed. 

(7) Proof/nonrepudiation of reception shows the sender of a message that the message 
has been received by its legitimate receiver correctly. 

(8) Notarization is the registration of messages to attest at a later stage its content, 
origin, destination or time of issue. 
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The correctness of key management protocols requires more than the existence of secure 
communication channels between entities and key management servers. For example, it 
critically depends on the capability of those servers to reliably follow the protocols. There- 
fore, each entity has to base its deductions not only on the protocol elements sent and 
received, but also on its trust in the server which, for that reason, often is called the trusted 
party. 

Key management is facilitated by the key management services, which include entity 
registration, key generation, certification, authentication, key distribution and key mainte- 
nance. 

Entity registration is a procedure by which an individual or a device is authenticated to 
the system. An absolute identification is provided if a link between an ID (e.g. a distin- 
guished name or a device-ID) and some physical representation of the identified subject 
(e.g. a person or a device) can be established. An identification can be carried out man- 
ually or automatically. Absolute identification always requires at least one initial manual 
identification (e.g. by showing a passport or a device-ID). 

Mutual authentication is usually based on the exchange of certificates. In any system, an 
entity is represented by some public data, called its (public) credentials (e.g. ID and address). 
Besides that, an entity may own secret credentials (e.g. testimonials) that may or may not 
be known by some trusted party. Whenever an entity is registered, a certificate based upon 
its credentials is issued as a proof of registration. This may involve various procedures, 
from a protected entry in a specific file to a signature by the certification authority on the 
credentials. 

Key generation refers to procedures by which keys or pairs of keys of good cryptographic 
quality are securely and unpredictably generated. This implies the use of a random or pseu- 
dorandom process involving random seeds, which cannot be manipulated. The requirements 
are that certain elements of the key space are not more probable than others, and that it is 
not possible for the unauthorized to gain knowledge about keys. 

Certificates are issued for authentication purposes. A credential containing identifying 
data together with other information (e.g. public keys) is rendered unforgeable by some 
certifying information (e.g. a digital signature provided by the key certification center). 
Certification may be an on-line service where some certification authority provides interac- 
tive support and is actively involved in key distribution processes, or it may be an off-line 
service so that certificates are issued to each entity only at some initial stage. 

Authentication/Verification may be either (1) entity authentication or identification, 
(2) message content authentication and (3) message origin authentication. The term verifi- 
cation refers to the process of checking the appropriate claims, i.e. the correct identity of an 
entity, the unaltered message content or the correct source of a message. The validity of a 
certificate can be verified using some public information (e.g. a public key of the key certi- 
fication center), and can be carried out without the assistance of the certification authority, 
so that the trusted party is only needed for issuing the certificates. 

Key distribution refers to procedures by which keys are securely provided to parties 
legitimately asking for them. The fundamental problem of key exchange or distribution is 
to establish keying material to be used in symmetric mechanisms whose origin, integrity 
and confidentiality can be guaranteed. As a result of varied design decisions appropriate 
to different circumstances, a large variety of key distribution protocols exist [37, 39]. The 
basic elements of a key distribution protocol are as follows. 
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15.3.1 Encipherment 

The confidentiality of a data item D can be ensured by enciphering D with an appropriate 
key K which is denoted eK(D). Depending on whether a secret key algorithm or a public 
key algorithm is used for the enciphering process, D will be enciphered with a secret key K 
shared between the sender and the legitimate recipient of the message, or with the legitimate 
recipient B's public key Kb p - Encipherment with the sender A's private key K^s, may be 
used to authenticate the origin of data item D, or to identify A. Encipherment with a secret 
key provides modification detection if B has some means to check the validity of D (e.g. if 
B knows D beforehand, or if D contains suitable redundancy). 

15.3.2 Modification detection codes 

To detect the modification of a data item D, one can add some redundancy that has to be 
calculated using a collision-free function, i.e. it must be infeasible to find two different values 
of D that render the same result. Moreover, this process has to involve a secret parameter K 
in order to prevent forgery. Appropriate combination of K and D also allows for data origin 
authentication. Examples of suitable building blocks are message authentication codes, or 
hash functions combined with encipherment. The generic form of this building block is 
D\ \mdcK(D). Modification detection codes (mdc) enable the legitimate recipient to detect 
unauthorized modification of the transmitted data immediately after receipt. The correctness 
of distributed keying material can also be checked if the sender confirms his knowledge of 
the key in a second step (see below). 

15.3.3 Replay detection codes 

To detect the replay of a message and to check its timeliness, some explicit or implicit 
challenge-and-response mechanism has to be used, since the recipient has to be able to 
decide on the acceptance. In most applications, the inclusion of a replay detection code 
denoted by D\\rdc (e.g. a timestamp TD, a counter CT, or a random number R) will only 
make sense if it is protected by modification detection. With symmetric cryptographic 
mechanisms, key modification, i.e. some combination (e.g. XOR) of the secret key with 
an rdc, can be used to detect the replay of a message. A special case is the process of key 
offsetting used to protect keying material enciphered for distribution where the key used 
for encipherment is XORed with a count value. 

1 5.3.4 Proof of knowledge of a key 

Authentication can be implemented by showing knowledge of a secret (e.g. a secret key). 
Nevertheless, a building block that proves the knowledge of a key K can also be useful, 
when K is public. There are several ways for A to prove to B the knowledge of a key that 
are all based on the principle of challenge and response in order to prevent a replay attack. 
Depending on the challenge which may be a data item in cleartext or in ciphertext, A has to 
process the key K and the rdc in an appropriate way (e.g. by encipherment, or by calculating 
a message authentication code), or A has to perform a deciphering operation. The challenge 
may explicitly be provided by B (e.g. a random number R) or implicitly be given by a 
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synchronized parameter (e.g. a timestamp TD or a counter CT). For some building blocks, 
the latter case requires only one pass to prove knowledge of K; its tradeoff is the necessary 
synchronization. If B provides a challenge enciphered with a key K* , the enciphered data 
item has to be unpredictable (e.g. a random number R, or a key K**). The generic form of 
this building block is authK(A to B). 

15.3.5 Point-to-point key distribution 

This is the basic mechanism of every key distribution scheme. If based on symmetric cryp- 
tographic techniques, point-to-point key distribution requires that the two parties involved 
already share a key that can be used to protect the keying material to be distributed. If 
based on asymmetric techniques, point-to-point key distribution requires that each of the 
two parties has a public key with its associated secret key, and the certificate of the public 
key produced by a certification authority known to the other party. General assumptions 
are: (1) the initiator A is able to generate or otherwise acquire a secret key K"\ (2) se- 
curity requirements are confidentiality of K*, modification and replay detection, mutual 
authentication of A and B, and a proof of delivery for K*. For point-to-point key distribu- 
tion protocols based on symmetric cryptographic techniques we additionally assume: (3) a 
key Kab is already shared by A and B. In generic form, a point-to-point key distribution 
protocol meeting those requirements can be described as shown in Table 15.2 [33]. 

Table 15.3 is an example of a specific point-to-point key distribution protocol [33] (N 
denotes a nonrepeating number, R a random number). 

For a point-to-point key distribution protocol based on public key techniques, we make 
the following supplementary assumptions: (4) there is no shared key known to A and B 
before the key exchange process starts; (5) there is a trusted third party C, where A can 
receive a certificate that contains the distinguished names of A and C, A',s public key Ka p , 
and the certificate's expiration date TE. The integrity of the certificate is protected by C's 
signature. As an example, A's certificate can look like IDq \ \ ID a \ \ Ka p \ \ TE\ \ eKc s [h(IDc \ \ 
ID A \\ K Ap \\ TE)]. 

The exchange of certificates can be performed off-line and is not shown in the following 
protocol. In this protocol, A sends a message (often referred to as token) to B that consists of a 

Table 15.2 Generic point-to-point key distribution 



A B 

eK AB (K*\\rdc) 
(2) -* authK*(A to B) 

authK*(B to A) <- (3) 



Table 15.3 Point-to-point key distribution 

A (ISO/IEC CD 9798-2) B 

eK AB (K*\\N) 

eK*(N\\R) <-(2) 
(3)^ R 
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Table 15.4 Point-to-point key distribution 



A 

(!)-»• 



(ISO/IEC CD 9798-2) 

eK Bp (K*)\\ rdc\\ eK As (h{ eK Bp {K*)\\ rdc)) 

eK*{ rdc) 



B 



<-(2) 



secret key K * enciphered with B ' s public key and an appended rdc. The integrity of the token 
is protected by A's signature. This guarantees modification and replay detection, as well 
as data origin authentication. B responds with the enciphered rdc, thereby acknowledging 
that it has received the key K*, as shown in Table 15.4. 

Key maintenance includes procedures for key activation, key storage, key replacement, 
key translation, key recovery, black listing of compromised keys, key deactivation and key 
deletion. Some of the issues of key maintenance are addressed below. 

Storage of keying material refers to a key storage facility which provides secure storage 
of keys for future use, e.g. confidentiality and integrity for secret keying material, or integrity 
for public keys. Secret keying material must be protected by physical security (e.g. by storing 
it within a cryptographic device) or enciphered by keys that have physical security. For all 
keying material, unauthorized modification must be detectable by suitable authentication 
mechanisms. 

Key archival refers to procedures by which keys for notarization or nonrepudiation 
services can be securely archived. Archived keys may need to be retrieved at a much later 
date to prove or disprove certain claims. 

Key replacement enables parties to securely update their keying material. A key shall be 
replaced when its compromise is known or suspected. A key shall also be replaced within 
the time deemed feasible to determine it by an exhaustive attack. A replaced key shall not 
be reused. The replacement key shall not be a variant or any nonsecret transformation of 
the original key. 

Key recovery refers to cryptographic keys which may become lost due to human error, 
software bugs or hardware malfunction. In communication security, a simple handshake 
at session initiation can ensure that both entities are using the same key. Also, message 
authentication techniques can be used for testing that plaintext has been recovered using 
the proper key. Key authentication techniques permit keys to be validated prior to their use. 
In the case where a key was lost, it still may be possible to recover that key by searching 
part of the key space. This approach may be successful, if the number of likely candidates 
is small enough. 

Key deletion refers to procedures by which parties are assured of the secure destruction 
of keys that are no longer needed. Destroying a key means eliminating all records of this 
key, such that no information remaining after the deletion provides any feasibly usable 
information about the destroyed key. 

More information on key management can be found in References [33-39]. 

15.4 SECURITY MANAGEMENT IN GSM NETWORKS 

The authentication generic process is shown in Figure 15.9 [41,42]. The authentication 
algorithm (called A3 in the GSM specifications) computes from a random number RAND, 



608 SECURITY 



Fixed network 




IMSI Mobile 



Figure 15.9 Generic authentication process. 

both at the MS (mobile station) and at the AuC, a signed response (SRES), using an 
individual secret key Kj attached to the mobile subscriber. The number RAND, whose 
value is drawn randomly between 0 and 2 128 — 1, is used to generate the response by the 
mobile as well as by the fixed part of the network. The authentication process is carried 
out both at the mobile and at the MSC simultaneously. The BSS remains transparent to 
this process. The mobile only receives the random number over the radio path and in turn 
returns the signed response to the network. Thus, an air interface mobile designation is 
not disclosed. At subscription time, the subscriber authentication key Kj is allocated to 
the subscriber together with its IMSI (international mobile station identity). The key K, is 
stored in the AuC and used to generate a triplet (K c , signed response, RAND) within the 
GSM system. As stated above, the same Kj is also stored at the mobile in the subscriber 
ID (SIM). In the AuC, the following steps are carried out in order to produce one triplet. A 
nonpredictable RAND is produced. RAND and Kj are used to calculate the signed response 
and the ciphering key (K c ) using two different algorithms (A3, A8). This triplet (RAND, 
signed response, and K c ) is generated for each and every user and is then delivered to the 
HLR. This procedure is shown in Figure 15.10 [41—43]. 

The AuC begins the authentication and cipher key generation procedures after receiving 
an identification of the subscriber from the MSC/VLR. The AuC first queries the HLR 
for the subscriber's authentication key Kj. It then generates a 128 b RAND for use as a 
challenge (nonce), to be sent to the MS for verification of the MS's authenticity. RAND 
is also used by the AuC, with Kj in the algorithm A3 for authentication, to calculate the 
expected correct signed response from the MS. RAND and Kj are also used in the AuC to 
calculate the cipher key K c with algorithm A8. The signed response is a 32 b number, and 
K c is a 64 b number. 

The values of RAND, signed response and K c are transmitted to the MSC/VLR for 
interaction with the MS. Algorithms A3 and A8 are not fully standardized by GSM and 
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Figure 15.10 Generation of K c , signed response, and RAND at the AuC. 

may be specified at the direction of PLMN operators. Different PLMNs may use different 
and proprietary versions of these algorithms. Also, to protect the secrecy of the user, the 
authentication key K, is not sent to the MSC/VLR. Based on the discretion of the PLMN 
operator, A", can be of any format and length. The MSC/VLR forwards the value of the 
RAND to the MS, which also has the correct K, and algorithm A3, which is stored in its 
SIM. The SIM then uses RAND and K c in these algorithms to calculate the authentication 
SRESc and the cipher key, K c . The MS sends the calculated response, SRESc, back to the 
MSC/VLR, which compares it with the value signed response received from the HLR/AuC. 
If the SRESc and the signed response agree, the subscriber access to the system is granted, 
and the cipher key K c is transferred to the BTS for use in encrypting and decrypting messages 
to and from the MS. If the SRESc (computed signed response at the mobile) and the signed 
response disagree, the subscriber access to the system is denied. In summary, the VLR 
initiates authentication toward the MS and checks the authentication result. The complete 
process is shown in Figure 15.11. 

The ciphering/deciphering algorithm (called A5) uses a cipher key K c , which is allocated 
to each mobile subscriber during the authentication procedures. The key K c is computed 
from the RAND by an algorithm (called A8) driven by the mobile subscriber authentication 
key K, . Algorithm A8 is common to all GSMs. Figure 15.10 has shown the process of gen- 
erating K c . For the authentication procedure, when a signed response is being calculated 
at the mobile, the ciphering key (K c ) is also calculated using another algorithm (A8), as 
shown in Figure 15.11. This key is set in the fixed system and in the MS. At the ciphering 
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Figure 15.11 Authentication process in GSM system. 



start command (from VLR to BSS), K c is used by the MS and the BTS in order to cipher 
and decipher the bitstream that is sent over the radio path. In addition to the authentication 
procedures, a key setting may be initiated by the network as often as the network opera- 
tor wishes. The command to use the encryption key is sent over the logical channel and 
dedicated control channel, as soon as the identity of the mobile subscriber is known by the 
network. 

The key K c must be agreed upon by the mobile station and the network prior to the start of 
encryption. The choice in GSM is to compute the key K c independently from the effective 
start of encryption during the authentication process. K c is then stored in a nonvolatile 
memory inside the SIM so as to be remembered even after a switched-off phase. This key is 
also stored in the visited MSC/VLR on the network side and is ready to be used for the start 
of encryption. The actual encryption/decryption of user data takes place within the mobile 
station and the BSS. For this purpose, the encryption key is downloaded from the MSC to 
the BTS via the BSC. After authentication, the transmission is ciphered, and K c is used for 
ciphering/deciphering. This process is shown in Figure 15.12. Data flow on the radio path 
is obtained by a bit-per-bit binary addition of the user data flow and ciphering bitstream 
generated by the GSM algorithm A5 using a ciphering key (K c ). This exact process of 
encryption/decryption at the mobile and at BTS is shown in Figure 15.13. Code words S\ 
and ^2 for downlinks and uplinks are changed at every frame. When modulo 2 is added 
with plain text, Si outputs cipher text. On the other side, the cipher text, when modulo 2 
is added with S] , outputs the plain text. The ciphering/deciphering function is placed on 
the transmission chain between the interleaver and the modulator. Since A3 and A8 are 
always running together, these two are implemented as a single algorithm in most cases. 
The algorithm A3 is standardized in the whole of GSM. 
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Figure 15.12 Sequential steps for encryption and decryption process. 
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Figure 15.13 Encryption/decryption process. 
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15.5 SECURITY MANAGEMENT IN UMTS 

Cryptographic functions used in UMTS are specified below: 



Algorithm: Purpose/usage 

0,S (operator specific, fully standardizedj/location in the network 

fO : random challenge generating function 
O/AuC 

fl : network authentication function 

0 - (MILENAGE) /USIM andAuC 

fl* : resynchronization message authentication function 

0 - (MILENAGE) /USIM andAuC 

f2 : user challenge-response authentication function 

0 - (MILENAGE) /USIM andAuC 

f3 : cipher key derivation function 

0 - (MILENAGE) /USIM andAuC 

f4 : integrity key derivation function 

0 - (MILENAGE) /USIM andAuC 

f5 : anonymity key derivation function for normal operation 
0 - (MILENAGE) /USIM andAuC 

f5* : anonymity key derivation function for resynchronization 
0 - (MILENAGE) /USIM andAuC 
f6 : MAP encryption algorithm 
S/MAP nodes 

f7 : MAP integrity algorithm 
S/MAP nodes 

f8 : UMTS encryption algorithm 
S - (KASUMI)ZMS and RNC 
f9 : UMTS integrity algorithm 
S - (KASUMI) /MS and RNC 
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Figure 15.14 Simplified UMTS security architecture. 



More details on these functions can be found in References [46-60] especially [52, 53, 
56, 57]. A simplified UMTS security architecture is shown in Figure 15.14. 

The security architecture is designed around a mutual authentication procedure that is 
executed between the user (USIM) and the SGSN/VLR3 at the network end. The procedure is 
called the UMTS Authentication and Key Agreement (AKA) since, in addition to providing 
authentication services, it also includes generation of session keys for confidentiality and 
integrity protection at the user end. The cryptographic algorithms/functions are defined in a 
requirements specification [49]. The AKA procedure is executed in two stages, as shown in 
Figure 15.14. The first stage involves transfer of security credentials (authentication vector, 
AV) from the home environment (HE) to the serving network (SN). The HE mainly consists 
of the home location register (HLR) and authentication center (AuC); the SN consists of the 
parts of the core network that are directly involved in setting up connections. With respect to 
access security, the SN network elements of interest are the SGSN, which handles packet- 
switched traffic, and the circuit-switched GSM nodes VLR/MSC (mobile switching center). 
An operator with a physical access infrastructure will normally have both HE and SN nodes. 

The authentication vectors contain sensitive data like challenge-response authentication 
data and cryptographic keys. It is therefore clear that the transfer of authentication vectors 
between the HLR/AuC and the SGSN/VLR needs to be secured against eavesdropping 
and modification (i.e. both the transfer's confidentiality and integrity must be protected). 
The actual transfer mechanism for the AVs is the SS7-based mobile application part (MAP) 
protocol. The MAP protocol itself contains no security functionality, but a security extension 
to MAP called MAPsec [50] has been developed by the 3G Partnership Project (3GPP). The 
MAPsec protocol belongs to the Network Domain Security (NDS) work area in 3GPP. NDS 
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covers both the MAPsec specification and specifications for how to protect IP connections 
on the control plane of the UMTS core network [51]. 

The second AKA stage is where the SGSN/ VLR executes the one-pass challenge- 
response procedure to achieve mutual entity authentication between the USIM and the 
network (SN, HE). A point to be made is that, in a two-staged AKA approach, the HE del- 
egates responsibility for executing the security procedures to the SN. There must therefore 
be a trust relationship between the HE and the SN in this matter. In the GSM environment, 
this trust relationship is regulated through roaming agreements; the same model should be 
applicable to UMTS. The cryptographic functions (/0 — /5*) used in the AKA procedure 
are implemented exclusively in the USIM and AuC. UMTS operators are free to choose 
any algorithm they want provided it complies with the function input/output specification 
given in Reference [49] . However, 3GPP has developed the MILENAGE [51,52] algorithm 
set to provide the AKA functions. The formal status of MILENAGE is that it is provided as 
an example algorithm set, but in practice it is the recommended algorithm set for the AKA 
functions. 

MILENAGE itself is built around the symmetric block cipher Rijndael. A consequence 
of having mutual authentication is that the USIM is now an active entity. In GSM, the user 
could not authenticate the network; hence, the UE could not reject the network. In UMTS, 
the USIM will attempt to authenticate the network and it is now possible that the USIM will 
reject the network. Details on the implementation of different security services in UMTS 
can be found in References [46-60]. 

15.6 SECURITY ARCHITECTURE FOR UMTS/WLAN INTERWORKING 

As specified in Chapter 1, 4G will integrate different wireless systems and provide in- 
tertechnology roaming. Security architectures will depend on the type (tight or loose) of 
interworking. A tight UMTS/WLAN interworking solution would mandate the full 3GPP 
security architecture and require the 3GPP protocol stacks and interfaces to be present in 
the WLAN system [64] . The loose interworking options merely require the 3GPP authen- 
tication method to be implemented. To avoid link layer modifications, the authentication 
protocol is allowed to run at the link layer using Internet protocols - extensible authen- 
tication protocol (EAP) [63] and authentication, authorization and accounting (AAA) - 
as transport mechanisms. When used in WLAN the protocol EAP will be referred to as 
EAPOL. The main 3GPP-WLAN interworking architecture is defined in 3 GPP TS 23.234 
[61]; the security architecture is found in 3GPP TS 33.234 [62]. 

A benefit of the loose interworking approach is that the 3GPP-WLAN architecture is 
a fairly simple architecture. The architecture contains the WLAN access network and a 
UMTS core network in addition to glue technology to connect the two systems. Figure 
15.15 gives an overview of the proposed architecture [64]. The two key glue components 
of the interworking solution are the AAA and EAP technologies. These are used to exe- 
cute the UMTS AKA protocol from the 3G system's home domain toward the WLAN user 
equipment. The AAA architecture and the RADIUS and/or Diameter protocol are to be used 
as the bridge between the 3GPP system and the WLAN access network. Both Diameter 
and RADIUS are generic protocols and are intended to provide support for a diverse set of 
AAA applications, including network access, IP mobility and interoperator roaming. The 
EAP-AKA [67] protocol allows the UMTS AKA security protocol, which was originally 
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Figure 15.15 3GPP-WLAN architecture. 

designed for execution over UTRAN, to be executed over the WLAN access toward the user 
equipment. EAP [63] is a key element in the 3GPP-WLAN security architecture. EAP pro- 
vides, in essence, a generic peer-to-peer based request-response transaction environment 
for authentication dialogs, and supports multiple authentication mechanisms. EAP typically 
runs directly over the link layer without requiring IP. EAP has its own flow control mech- 
anisms, and is capable of removing duplicate messages and retransmitting lost messages. 
EAP can be used over different link layer protocols including the IEEE WLAN link layer. 
The necessary EAP encapsulation is described in the EAP-over-LAN specification [68]. 

The EAP protocol does not natively provide much in terms of authentication mecha- 
nisms. Instead, its power lies in its generic mechanism to support existing authentication 
methods through specialized EAP methods. EAP contains a negotiation sequence where the 
authenticator requests information about which authentication method to use. The EAP ar- 
chitecture does not require the authenticator to support all authentication methods. Instead, 
the authenticator can request assistance from a backend authentication server to complete 
the authentication processing. The specific authentication methods supported are defined in 
separate specifications detailing how the EAP framework is to be used to run the target au- 
thentication methods. For 3GPPWLAN interworking, primary interest is in the EAP-AKA 
[64, 67] methods. An execution of the EAP-AKA procedure specific to the IEEE 802.11 
WLAN is illustrated in Figure 15.16. 



15.7 SECURITY IN AD HOC NETWORKS 



Traditional security mechanisms, such as authentication protocols, digital signature and 
encryption, still play important roles in achieving confidentiality, integrity, authentication 
and nonrepudiation of communication in ad hoc networks. However, these mechanisms are 
not sufficient by themselves. 
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Figure 15.16 UMTS AKA procedure between a 3 GPP network and an 802. 1 1 WLAN MS. 



In Zhou and Haas [69], two additional principles are discussed. First, the redundancies 
in the network topology (i.e. multiple routes between nodes) are exploited to achieve avail- 
ability. The second principle is distribution of trust. Although no single node is trustworthy 
in an ad hoc network because of low physical security and availability, the trust can be 
distributed to an aggregation of nodes. Assuming that any t + 1 nodes will be unlikely to 
all be compromised, consensus of at least t + 1 nodes is trustworthy. 

Although certain physical layer countermeasures, such as spread spectrum and coding, 
are possible [70, 79, 83, 84], we will only focus on how to defend against denial of service 
attacks towards routing protocols. 

In most routing protocols, discussed in Chapter 13, routers exchange information on the 
topology of the network and this information could become a target for malicious adversaries 
who intend to bring the network down. 

There are two sources of threats to routing protocols. The first comes from external 
attackers. By injecting erroneous routing information replaying old routing information 
or distorting routing information, an attacker could successfully partition a network or 
introduce excessive traffic load into the network by causing retransmission and inefficient 
routing. The second and also the more severe kind of threats comes from compromised 
nodes, which might advertise incorrect routing information to other nodes. Detection of 
such incorrect information is difficult. Merely requiring routing information to be signed by 
each node would not work, because compromised nodes are able to generate valid signatures 
using their private keys. 
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In the first case, nodes can protect routing information in the same way they protect data 
traffic, i.e. through the use of cryptographic schemes such as digital signature. However, 
this defense is ineffective against attacks from compromised servers. Worse yet, we cannot 
neglect the possibility of nodes being compromised in an ad hoc network. Detection of com- 
promised nodes through routing information is also difficult in an ad hoc network because 
of its dynamically changing topology. When a piece of routing information is found to be in- 
valid, the information could be generated by a compromised node, or, it could have become 
invalid as a result of topology changes. It is difficult to distinguish between the two cases. 

On the other hand, certain properties of ad hoc networks can be exploited to achieve 
secure routing. For example, the routing protocols for ad hoc networks must handle outdated 
routing information to accommodate the dynamically changing topology. False routing 
information generated by compromised nodes could, to some extent, be considered outdated 
information. As long as there are sufficiently many correct nodes, the routing protocol 
should be able to find routes that go around these compromised nodes. Such capability of 
the routing protocols usually relies on the inherent redundancies due to multiple, possibly 
disjoint, routes between nodes in ad hoc networks. If routing protocols can discover multiple 
routes (e.g. protocols in ZRP, DSR, TORA and AODV, discussed in Chapter 13; all can 
achieve this), nodes can switch to an alternative route when the primary route appears to 
have failed. 

Diversity coding, discussed in Chapter 7, takes advantage of multiple paths in an efficient 
way without message retransmission. The basic idea, discussed in Chapter 7, is to transmit 
redundant information through additional routes for error detection and correction. For 
example, if there are n disjoint routes between two nodes, then we can use n — r channels to 
transmit data and use the other r channels to transmit redundant information. Even if certain 
routes are compromised, the receiver may still be able to validate messages and to recover 
messages from errors using the redundant information from the additional r channels. 

Key management service using a single CA (certification authority) in ad hoc networks 
is problematic. The CA, responsible for the security of the entire network, is a vulnerable 
point of the network. If the CA is unavailable, nodes cannot obtain the current public keys of 
other nodes or to establish secure communication with others. If the CA is compromised and 
leaks its private key to an adversary, the adversary can then sign any erroneous certificate 
using this private key to impersonate any node or to revoke any certificate. 

A standard approach to improve availability of a service is replication. However, a simple 
replication of the CA makes the service more vulnerable. Compromise of any single replica, 
which possesses the service private key, could lead to collapse of the entire system. To solve 
this problem, the trust can be distributed to a set of nodes with a collective key management 
responsibility [69] . 

In such a system, the service as a whole has a public/private key pair. All nodes in 
the system know the public key of the service and trust any certificates signed using the 
corresponding private key. Nodes, as clients, can submit query requests to get other clients' 
public keys or submit update requests to change their own public keys. 

The key management service, with an (n, t + 1) configuration (« > 3t + 1), consists 
of n special nodes, called servers, located within an ad hoc network. Each server also has 
its own key pair and stores the public keys of all the nodes in the network. In particular, 
each server knows the public keys of other servers. Thus, servers can establish secure links 
among them. It is assumed that the adversary can compromise up to t servers in any period 
of time with a certain duration. 



618 SECURITY 



If a server is compromised, then the adversary has access to all the secret information 
stored on the server. A compromised server might be unavailable or exhibit Byzantine be- 
havior (i.e. it can deviate arbitrarily from its protocols). It is also assumed that the adversary 
lacks the computational power to break the cryptographic schemes employed. The service is 
correct if robustness and confidentiality are preserved. Robustness assumes that the service 
is always able to process query and update requests from clients. Every query always returns 
the last updated public key associated with the requested client, assuming no concurrent 
updates on this entry. Confidentiality assumes that the private key of the service is never 
disclosed to an adversary so that an adversary is never able to issue certificates, signed by 
the service private key, for erroneous bindings. 

Threshold cryptography is used to accomplish distribution of trust in the key management 
service [71, 72]. An (n, t + 1) threshold cryptography scheme allows n parties to share the 
ability to perform a cryptographic operation (e.g. creating a digital signature), so that any 
t + 1 parties can perform this operation jointly, whereas it is infeasible for at most t parties 
to do so, even by collusion. 

In this case, the n servers of the key management service share the ability to sign 
certificates. For the service to tolerate t compromised servers, an («, t + 1) threshold cryp- 
tography scheme is employed and the private key k of the service is divided into n shares 

(si , S2, ■ ■ ■ , s n ), by assigning one share to each server. In this context (si , S2 s n )is referred 

to as an («, t + 1) sharing of k. The service is configured as illustrated in Figure 15.17. 

For the service to sign a certificate, each server generates a partial signature for the cer- 
tificate using its private key share and submits the partial signature to a combiner. With t + 1 
correct partial signatures, the combiner is able to compute the signature for the certificate as 
illustrated in Figure 15.18, where servers generate a signature using a (3, 2) threshold signa- 
ture scheme. The compromised servers (there are at most t of them) cannot generate correctly 
signed certificates by themselves, because they can generate at most t partial signatures. 

If Klk is the public/private key pair of the service then by using a (3, 2) threshold 
cryptography scheme in Figure 15.18, each server i gets a share si of the private key k. For 
a message m, server i can generate a partial signature PS(m, s,) using its share s,. Correct 
servers 1 and 3 both generate partial signatures and forward the signatures to a combiner c. 
Even though server 2 fails to submit a partial signature, c is able to generate the signature 
(m\ of m signed by service private key k [71, 72]. 




Figure 15.17 The configuration of a key management service. 
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Figure 15.18 Threshold signature with three servers. 



A compromised server could generate an incorrect partial signature, and use of this partial 
signature would yield an invalid signature. Fortunately, a combiner can verify the validity of a 
computed signature using the service public key. In case verification fails, the combiner tries 
another set of t + 1 partial signatures. This process continues until the combiner constructs 
the correct signature from t + 1 correct partial signatures. More efficient robust combining 
schemes are proposed [73, 74]. These schemes exploit the inherent redundancies in the 
partial signatures (note that any t + 1 correct partial signatures contain all the information 
of the final signature) and use error correction codes to mask incorrect partial signatures. In 
Gennaro et al. [73], a robust threshold DSS (digital signature standard) scheme is proposed. 
The process of computing a signature from partial signatures is essentially an interpolation. 
The authors uses the Berlekamp and Welch decoder, so that the interpolation still yields a 
correct signature despite a small portion (fewer than one-quarter) of partial signatures being 
missing or incorrect. 

Even if the compromised servers are detected and excluded from the service, the adver- 
sary could still gather more than t shares of the private key from compromised servers over 
time. This would allow the adversary to generate any valid certificates signed by the private 
key. 

Proactive schemes [75, 76] are proposed as a countermeasure to mobile adversaries. A 
proactive threshold cryptography scheme uses share refreshing, which enables servers to 
compute new shares from old ones in collaboration without disclosing the service private 
key to any server. The new shares constitute a new (n, t + 1) sharing of the service private 
key. After refreshing, servers remove the old shares and use the new ones to generate partial 
signatures. Because the new shares are independent of the old ones, the adversary cannot 
combine old shares with new shares to recover the private key of the service. Thus, the 
adversary is challenged to compromise t + 1 servers between periodic refreshing. 

Share refreshing must tolerate missing newly generated shares (called subshares) and 
erroneous subshares from compromised servers. A compromised server may not send any 
subshares. However, as long as correct servers agree on the set of subshares to use, they 
can generate new shares using only subshares generated from t + 1 servers. For servers to 
detect incorrect subshares, verifiable secret sharing schemes are used, for example, those 
in Pedersen [77]. A verifiable secret sharing scheme generates extra public information for 
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each (sub)share using a one-way function. The public information can testify the correctness 
of the corresponding (sub)shares without disclosing the (sub)shares. 

A variation of share refreshing also allows the key management service to change its 
configuration from («, t + 1) to («', t' + 1). This way, the key management service can 
adapt itself, on the fly, to changes in the network. If a compromised server is detected, the 
service should exclude the compromised server and refresh the exposed share. If a server is 
no longer available or if a new server is added, the service should change its configuration 
accordingly. For example, a key management service may start with the (7,3) configuration. 
If, after some time, one server is detected to be compromised and another server is no longer 
available, then the service could change its setting to the (5, 2) configuration. If two new 
servers are added later, the service could change its configuration back to (7, 3) with the 
new set of servers. 

15.7.1 Self-organized key management 

In this section we discuss a self-organizing public-key management system that allows users 
to create, store, distribute and revoke their public keys without the help of any trusted 




Updated local repository of u(G u ) 



Figure 15.19 The creation of nodes' updated and nonupdated repositories (where 
— G nu ). 
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authority or fixed server [85]. In the system model, the public keys and the certificates of 
the system are represented as a directed graph G( V, E), where V and E stand for the set of 
vertices and the set of edges, respectively. This graph will be referred to as the certificate 
graph. The vertices of the certificate graph represent public keys and the edges represent 
certificates. More precisely, there is a directed edge from vertex Ku to vertex Kw if there is 
a certficate signed with the private key of u that binds Kw to an identity. A certificate chain 
from a public key Ku to another public key Kv is represented by a directed path from vertex 
Ku to vertex Kv in G. Thus, the existence of a certificate chain from Ku to Kv means that 
vertex Kv is reachable from vertex Ku in G (denoted below K u ^g K v ).ln the following, the 
certificate graph G designates the graph comprising only the valid (not expired) certificates 
of the whole network. In the model, the updated and the nonupdated certificate repositories 
of user u are represented by the certificate graphs Gu and Gnu, respectively. Therefore, for 
any u, Gu is a subgraph of G, but Gnu is not necessarily a subgraph of G, as it may also 
contain some implicitly revoked certificates. 

As shown in Figure 15.19, the initial phase of the scheme is executed in four steps: the 
creation of public/private key pairs, the issuing of certificates, the certificate exchange, and 
the creation of nodes' updated certificate repositories. 



In step 0, the user creates her own public/private key pair. 

In step 1, she issues public-key certificates based on her knowledge about other users' 
public keys. The issuing of public-key certificates also continues when the system is fully 
operational ( i. e. when the updated and nonupdated repositories are already constructed) 
as users get more information about other users ' public keys. During this process, the 
certificate graph G is created. The speed of the creation of a usable (i.e. sufficiently 
connected) certificate graph heavily depends on the motivation of the users to issue 
certificates. 

In step 2, the node performs the certificate exchange. During this step, the node collects 
certificates and thus creates its nonupdated certificate repository. Along with the cre- 
ation of new certificates, the certificate exchange also continues even when the system is 
fully operational. This means that nodes ' nonupdated repositories will be continuously 
upgraded with new certificates. The nodes ' mobility determines the speed at which cer- 
tificates are accumulated by the nodes themselves. 

In step 3, the node constructs its updated certificate repository. The node can perform 
this operation in two ways, either by communicating with its certificate graph neighbors 
(Step 3a), or by applying the repository construction algorithm on the nonupdated cer- 
tificate repository ( Step 3b). When the node constructed its updated certificate repository, 
it is ready to perform authentication. 



The authentication is performed in such a way that, when a user u wants to verify the 
authenticity of the public key Kv of another user v, u tries to find a directed path from 
Ku to Kv in Gu U Gv. The certificates on this path are then used by u to authenticate Kv. 
An example of a certificate graph with updated local repositories of the users is shown in 
Figure 15.20. If there is no path from Ku to Kv in Gu U Gv, u tries to find a path from Ku 
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Figure 15.20 A certificate graph and paths of certificates between users u and v in their 
merged updated local repositories. 

to Kv in Gu U Gnv. If such a path is found, it updates the expired certificates, checks their 
correctness and performs authentication. If there is no path from Ku to Kv in Gu U Gnv, u 
fails to authenticate Kv. A detailed description of each operation can be found in Capkun 
etal. [85]. 



15.8 SECURITY IN SENSOR NETWORKS 

Owing to its inherent simplicity, sensor network routing protocols are sometimes even 
more susceptible to attacks against general ad-hoc routing protocols. Most network layer 
attacks against sensor networks fall into one of the following categories: spoofed, altered or 
replayed routing information, selective forwarding, sinkhole attacks, sybil attacks, worm- 
holes, HELLO flood attacks or acknowledgement spoofing. Spoofing, altering or replaying 
routing information may be used by adversaries to create routing loops, attract or repel 
network traffic, extend or shorten source routes, generate false error messages, partition the 
network, increase end-to-end latency, etc. 

Selective forwarding attack refers to the case where malicious nodes refuse to forward 
certain messages and simply drop them, ensuring that they are not propagated any further. 
Selective forwarding attacks are typically most effective when the attacker is explicitly 
included on the path of a data flow. In the next two sections, we discuss sinkhole attacks 
and the Sybil attack, two mechanisms by which an adversary can efficiently include herself 
on the path of the targeted data flow. 
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Sinkhole attacks typically work by making a compromised node look especially attractive 
to surrounding nodes with respect to the routing algorithm. For instance, an adversary could 
spoof or replay an advertisement for an extremely high quality route to a base station. 

One motivation for mounting a sinkhole attack is that it makes selective forwarding 
trivial. By ensuring that all traffic in the targeted area flows through a compromised node, an 
adversary can selectively suppress or modify packets originating from any node in the area. 

Sybil attack refers to the case where a single node presents multiple identities to other 
nodes in the network. Sybil attacks pose a significant threat to geographic routing protocols. 
Location-aware routing often requires nodes to exchange coordinate information with their 
neighbors to efficiently route geographically addressed packets. It is only reasonable to 
expect a node to accept but a single set of coordinates from each of its neighbors, but by 
using the Sybil attack an adversary can 'be in more than one place at once'. 

The wormhole attack refers to the case where an adversary tunnels messages received in 
one part of the network over a low latency link and replays them in a different part. Wormhole 
attacks most commonly involve two distant malicious nodes colluding to understate their 
distance from each other by relaying packets along an out-of-bounds channel available only 
to the attacker. An adversary situated close to a base station may be able to completely 
disrupt routing by creating a well-placed wormhole. An adversary could convince nodes 
who would normally be multiple hops from a base station that they are only one or two hops 
away via the wormhole. This can create a sinkhole: since the adversary on the other side of 
the wormhole can artificially provide a high-quality route to the base station, potentially all 
traffic in the surrounding area will be drawn through it if alternate routes are significantly 
less attractive. This will most likely always be the case when the endpoint of the wormhole is 
relatively far from a base station. Figure 15.21 shows an example of a wormhole being used 
to create a sinkhole. Wormholes can also be used simply to convince two distant nodes that 
they are neighbors by relaying packets between the two of them. Wormhole attacks would 
probably be used in combination with selective forwarding or eavesdropping. Detection is 
potentially difficult when used in conjunction with the Sybil attack. 




high bandwidth link 



Figure 15.21 A laptop-class adversary using a wormhole to create a sinkhole in TinyOS 
beaconing. 
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HELLO floods can be thought of as one-way, broadcast wormholes. Many protocols 
require nodes to broadcast HELLO packets to announce themselves to their neighbors, 
and a node receiving such a packet may assume that it is within (normal) radio range of 
the sender. This assumption may be false: a laptop-class attacker broadcasting routing or 
other information with large enough transmission power could convince every node in the 
network that the adversary is its neighbor. 

An adversary does not necessarily need to be able to construct legitimate traffic in order 
to use the HELLO flood attack. It can simply re-broadcast overheard packets with enough 
power to be received by every node in the network. 

Acknowledgement spoofing refers to the case where an adversary can spoof link layer 
acknowledgments for 'overheard' packets addressed to neighboring nodes. Goals include 
convincing the sender that a weak link is strong or that a dead or disabled node is alive. 
For example, a routing protocol may select the next hop in a path using link reliability. 
Artificially reinforcing a weak or dead link is a subtle way of manipulating such a scheme. 
Since packets sent along weak or dead links are lost, an adversary can effectively mount 
a selective forwarding attack using acknowledgement spoofing by encouraging the target 
node to transmit packets on those links. 

Attacks on specific sensor networks routing protocol and possible countermeasures are 
discussed in References [86, 87]. 
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Active Networks 



16.1 INTRODUCTION 

The basic goals of active networking (AN) are to create networking technologies that, 
in contrast to current networks, are easy to evolve and which allow application specific 
customization. To achieve these goals, AN uses a simple idea, that the network would be 
easier to change and customize if it were programmable [1-6]. While AN has the high-level 
goals of improving evolvability and customizabilty, there are a number of low-level concerns 
that must be balanced to achieve these goals. The first of these concerns is flexibility. AN 
systems aim to significantly improve the flexibility with which we can build networks. The 
second concern is safety and security. It is crucial that, while adding flexibility, we do not 
compromise the safety or security of the resulting system. The third concern is performance. 
If adding flexibility results in a system that cannot achieve its performance goals, it will be 
pointless. The final concern is usability. It is important that the resulting system not be so 
complex as to be unusable. 

One of the basic techniques of AN is that code (program) is moved to the node at which it 
should execute. One place this idea first arose was in distributed systems supporting process 
migration. Another significant early influence was in generalization of remote procedure 
call (RPC) to support remote evaluation (RE). Both of these techniques will be discussed 
in detail in applications for network management in Chapter 18. The next important step in 
this direction was a DARPA proposal on the topic of 'Protocol boosters for distributed com- 
puting systems'. The idea was to dynamically construct protocols using protocol element 
insertion and deletion on an as-needed basis, to respond to network dynamics. Protocols 
were constructed optimistically; that is, ideal operating conditions (no errors, low delays, 
adequate throughput, etc.) were assumed, and protocol elements (such as error detection 
and correction mechanisms) were inserted into protocols on-demand, as conditions were 
encountered that deviated from the best case where the protocol element would not be 
needed. 
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On the network level the previous concepts resulted in middleboxes [7] . Domain-specific 
'middleboxes' [7] are appearing such as firewalls, network address translators (NATs) and 
intrusion detection systems (IDSs). Examples of such middleboxes include NATs [8-10], 
NAT with protocol translator (NAT-PT) [11], SOCKS gateway [12], Packet classifiers, 
markers and schedulers [13], TCP performance enhancing proxies [14], IP firewalls 
[15, 16], gatekeepers/session control boxes [17, 18], transcoders, proxies [17, 19]. These 
various application-driven ad hoc examples of STF (store, translate and forward) function- 
ality can be unified in a common framework, and embedded in a common programming 
model reinforced with the safety and security. 

The IETF's forwarding and control element separation (ForCES) working group [20] 
models router architectures in a way that allows the introduction of programmable and active 
technologies into the Internet control plane, in the style of base building block (BBN)s FIRE 
[21]. 

The principal observation here is that forwarding and routing are distinct activities. 
Routing is part of the 'control plane' of the IP Internet, while forwarding is the 'transport 
plane'. These activities were consolidated in traditional IP routers, but are now recognized 
as logically separate (viz. MPLS). The separation permits more general network elements to 
replace IP routers in performing control plane activities, allowing distributed routing with 
improved performance, slightly more global optimization, and perhaps surprisingly, an 
increase in security. In addition, the separation of routing and forwarding functions permits 
decoupling of their performance, allowing better tracking of technology improvements in 
both forwarder mechanics and routing algorithms. 

Active router control uses fast forwarders as a virtual link layer, managed by specialized 
active router controllers (ARCs), as shown in Figure 16.1. Using a set of routers as a 'router 
in a room' is not uncommon; one could simply reduce their autonomy and specialize them 
to IP forwarding, making way for source routing over all optical networks or routing/router 
control internal to the network. The use of general purpose network elements permits this 
separation of concerns, while offering the potential for improvement of the Internet on a 
number of axes. 




Figure 16.1 Active router controller managing a set of forwarder/routers. 
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Figure 16.2 ARCs distributed throughout an Internet. 



In this configuration, choices of routes are made by the ARC. A significant advantage 
is that specialized forwarding tables can be loaded into each forwarder; these tables can be 
small since entries must exist only for adjacent nodes. 

A key advantage of the ARC model is that, for computationally centered tasks (routing, 
or more general computations if the programmability of ARCs is exploited), a computer 
which tracks computer technology trend exponentials (faster CPU, larger RAM) is used, 
while the forwarders independently track networking technology trend exponentials such 
as bandwidth improvements. 

The basic distributed control architecture of Figure 16.1 can be replicated throughout 
an Internetwork, with the active elements using the managed Internet routes as link layers. 
This is shown in Figure 16.2, where a set of active nodes has been grafted into a larger 
collection of forwarders to create an active Internet. 



16.2 PROGRAMABLE NETWORKS REFERENCE MODELS 

IEEE PI 520 is formalized by the IEEE Project 1520 standards initiative for programmable 
network interfaces and its corresponding reference model [22]. The IEEE PI 520 RM, 
depicted in Figure 16.3, defines the following four interfaces. 

(1) CCM interface - the connection control and management interface is a collection 
of protocols that enable the exchange of state and control information at a very low 
level between the NE and an external agent. 
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Figure 16.3 P1520 reference model and the L-interface abstraction model. 



(2) L-interface - this defines an application program interface (API) that consists of 
methods for manipulating local network resources abstracted as objects. This ab- 
straction isolates upper layers from hardware dependencies or other proprietary 
interfaces. 

(3) U-interface - this mainly provides an API that deals with connection setup issues. 
The U-interface isolates the diversity of connection set-up requests from the actual 
algorithms that implement them. 

(4) V-interface - it provides a rich set of APIs to write highly customized software, often 
in the form of value-added services. 

CCM and L-interfaces fall under the category of NE (network element) interfaces, 
whereas U- and V-interfaces constitute network- wide interfaces. Initial efforts through 
the ATM sub-working group (PI 520.2) focused on telecommunication networks based on 
ATM and introduced programmability in the control plane [23]. Later, the IP Sub-working 
Group extended these principles to IP networks and routers. 



16.2.1 IETF ForCES 

Recently, a working group of IETF, called forwarding and control element separation 
(ForCES) was formed with a similar objective to that of P1520 [24, 25]. Document [25] 
classifies the NE as a collection of components of two types: control elements (CE) and 
forwarding elements (FE) operating in the control and forwarding (transport) planes, re- 
spectively. CE's host controls functionality, like routing and signaling protocols, whereas 
FEs perform operations on packets, like header processing, metering, scheduling, etc. , when 
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Figure 16.4 ForCES architectural representation of NE. 

passing through them. CEs and FEs may be interconnected with each other in every possible 
combination (CE-CE, CE-FE, FE-FE), thus forming arbitrary types of logical topologies 
(see Figure 16.4). Every distinct combination defines a reference point, namely, F r , F p and 
Fj . Each one of these reference points may define a protocol or a collection thereof, but 
ForCES protocol is only defined for the F p reference point. 



16.2.2 Active networks reference architecture 

Active networks transform the store-and-forward network into store-compute-and-forward 
network. The difference here is that packets are no longer passive but rather active in 
the sense that they carry executable code together with their data payload. This code is 
dispatched and executed at designated (active) nodes performing operations on the packet 
data as well as changing the current state of the node to be found by the packets that follow. 
Two approaches are possible based on whether programs and data are carried discretely, 
within program and data packets (out-of-band) or in an integrated manner, i.e. in-band. 

In the discrete case, injecting code into the node and processing packets are two separated 
jobs. The user or network operator first injects his customized code into the routers along a 
path. Then the data packet arrives, its header is examined and the appropriate preinstalled 
code is loaded to operate on its contents [26, 27]. Separate mechanisms for loading and 
executing may be required for the control thereof. This separation enables network opera- 
tors to dynamically download code to extend a node's capabilities, which in turn become 
available to customers through execution. 

At the other extreme lies the integrated approach where code and data are carried by 
the same packet [28]. In this context, when a packet arrives at a node, code and data are 
separated, and the code is loaded to operate on the packet or change the state of the node. 
A hybrid approach has also been proposed [29]. 

The active networks reference architecture model [30, 31] is shown in Figure 16.5. An 
active network is a mixture of active and legacy (nonactive) nodes. The active nodes run the 
node operating system (NodeOS) - not necessarily the same, while a number of execution 
environments (EE) coexist at the same node. Finally a number of active applications (AA) 
make use of services offered by the EEs. 
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The NodeOS simultaneously supports multiple EEs. Its major functionality is to provide 
isolation among EEs through resource allocation and control mechanisms, and to provide 
security mechanisms to protect EEs from each other. It may also provide other basic facilities 
like caching or code distribution that EEs may use to build higher abstractions to be presented 
to their AAs. All these capabilities are encapsulated by the node interface through which 
EEs interact with the NodeOS. This is the minimal fixed point at which interoperability is 
achieved [31]. In contrast EEs implement a very broad definition of a network API ranging 
from programming languages to virtual machines, to static APIs in the form of a simple list 
of fixed-size parameters, etc. [32]. EE takes the form of a middleware toolkit for creating, 
composing and deploying services. 

The AN reference architecture [30] is designed for simultaneously supporting a multi- 
plicity of EEs at a node. Only EEs of the same type are allowed to communicate with each 
other, whereas EEs of different type are kept isolated from each other. A thorough analysis 
and comparison of programmable networks may be found in References [33, 34]. 

The FAIN (future active IP network) NE reference architecture is depicted in Figure 16.6. 
It describes how the ingredients identified previously may synergistically be combined in 
building next-generation NEs capable of seamlessly incorporating new functionality or 
dynamically configured to change their behavior according to new service requirements. 
One of the key concepts defined by the FAIN architecture is the EE. In FAIN, drawing 
from an analogy based on the concepts of class and object in object-oriented systems, we 
distinguish EEs between the EE type and the EE instances thereof. 

An EE type is characterized by the programming methodology and the programming 
environment that is created as a result of the methodology used. The EE type is free of 
any implementation details. In contrast, an EE instance represents the realization of the EE 
type in the form of a runtime environment by using specific implementation technology, 
e.g. programming language and binding mechanisms to maintain operation of the runtime 
environment. For details see References [21, 35]. 



EVOLUTION TO 4G WIRELESS NETWORKS 



635 




Figure 16.6 FAIN NE reference architecture. 



16.3 EVOLUTION TO 4G WIRELESS NETWORKS 



Given the volume of investments in existing networks infrastructure, a possible way to 
4G might be evolution rather than any revolution. By 'network evolution' we mean any 
incremental change to a network that modifies or enhances existing functionality or adds 
new functionality. In the context of active networking, evolution should be able to occur 
at remote nodes while the network is operational with only minimal disruption to existing 
services. AN achieves evolution by changing the programs that operate the network. Thus the 
ways in which we can evolve the network are dictated by the programmability mechanisms 
that are available to make such changes. 

Active packets (AP) are perhaps the most radical AN technology for evolution and they 
are the only mechanism that, at a high-level, are specific to AN. Such packets carry (or 
literally are) programs that execute as they pass through the nodes of network. A packet 
can perform management actions on the nodes, effect its own routing, or form the basis of 
larger protocols between distributed nodes, e.g. routing protocols. Such packets (ANTS [38], 
Smart Packets [39] and PLAN [40]) are like conventional packets, but with qualitatively 
more power and flexibility. 

Active packet evolution does not require changes to the nodes of the network. Instead, it 
functions solely by the execution of APs utilizing standard services. The disadvantage of 
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this approach is that taking advantage of new functionality requires the use of new packet 
programs. This means that at some level the applications using the functionality must be 
aware that the new functionality exists. This is the kind of evolution facilitated by pure AP 
systems, such as ANTS [38] and in essence it embodies the AN goal of application-level 
customization. 

The programmability mechanism that is broadly familiar outside the AN community is 
the plug-in extension. Plug-in extensions can be downloaded and dynamically linked into a 
node to add new node-level functionality. For this new functionality to be used, it must be 
callable from some prebuilt and known interface. For example, a packet program will have 
a standard way of calling node resident services. If it is possible to add a plug-in extension 
to the set of callable services (typically by extending the service name space) then such an 
extension 'plugs in' to the service call interface. 

The programmability mechanism known as the update extensions may also be down- 
loaded, but they go beyond plug-in extensions in that they can update or modify existing 
code and can do so even while the node remains operational. Thus, such extension can add 
to or modify a system's functionality even when there does not exist an interface for it to 
hook into. 

An example of AP evolution program written in a special-purpose language, PLAN 
(packet language for AN) [40], for mobility management protocol can be found in Song 
et al. [36]. Before a mobile node leaves its home network, it must identify the router that 
serves as its home agent. For simplicity, it is assumed that its default router serves this 
purpose. Once the node has attached itself to a new network and has a unique address, 
it sends an AP containing a control program to register itself to its home agent. When 
executed, this program simply adds the information to the home agent's soft-state keyed 
by the mobile node's home network address. Both the application aware and transparent 
versions share the same soft-state entries, allowing them to use the same control program 
and to co-exist. 

Figure 16.7 illustrates how we detect that a packet is at the home agent of a mobile host 
and how the packet is then tunneled to the unique address. The PLAN code for the packet 




Figure 16.7 Active packets for mobile-IP (MH, mobile host; HA, home agent; CH, corre- 
sponding host.) 
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that must be sent by the application is [36]: 

fun getToAgent(dest, payload, port) — 
try 

let valfagent — lookupTuple(dest) in 

OnRemote(\FoundFA\(dest, payload, port), fagent, getRBf ), defRoute) 
end 

handle NotFound —> 
if (thisHostls(dest)) then deliver(payload, port) 
else 

let val next — defaultRoute(dest) in 

OnNeighbor(\getToAgent\(dest, payload, port), #1 next, getRB( ),#2 next)) 
end 

fun FoundFA(dest, payload, port) — 
let val hop — defaultRoute(dest) in 

OnNeighbor(\deliver\(payload, port), #1 hop, getRB(), #2 hop)) 
end 

GetToAgent is the main function and, when it executes, it first looks up dest in the soft- 
store using lookupTuple. If that succeeds, it has found the home agent and it uses OnRemote 
to send a new packet, the tunnel, that will execute FoundFA at the foreign agent with the 
same arguments as getToAgent. 

OnRemote provides multihop transmission of the packet without execution until it 
reaches the foreign agent. If the lookup fails the handle will execute. If we have actu- 
ally reached the host then we deliver the packet. Otherwise, it looks up the next hop toward 
dest. It then uses OnNeighbor, which only transmits a packet one hop, to send the packet. 
Thus the packet travels hop-by-hop looking for the home agent. FoundFA function executes 
on the foreign agent, which in this case is the mobile host, but might be some other node 
on the same sub-net. It sends a packet to the dest that does the delivery. This is where the 
original packet is removed from the tunnel. Notice that all of that functionality is encoded 
in the tunnel packet program itself; the foreign agent does not need to have any knowledge 
of its role as a tunnel endpoint - it just has to support PLAN. 

If a node defines an extensible interface, new extensions can be loaded and plugged into 
this interface to provide extended or enhanced functionality. This is the essence of plug-in 
extension evolution. As an illustration consider the pseudocode shown below for a simple 
AP that implements route discovery [36]: 

Route Discover(S/m/?/e DSR) 

procedure Route Request(Target,RouteRec) 

if Duplicate Request Packet Or My Address Already In Route Rec 
then Discard 

if This Host is Target 
then Route Reply(RouteRec) 



else 



else 



Append My Address To Route 
Flood To All Neighbours 



The packet itself embodies many key aspects of the protocol directly. In particular, it 
does duplicate elimination, tests for routing loops, detects termination and sends a reply, 
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and performs flooding. In general, since many protocols have relatively simple control flow, 
simple APs can implement key aspects of the protocol directly. This algorithm benefits 
from noderesident services that are specific to the protocol, particularly to detect if this is 
a duplicate request or if the current node is already in the route record. 



16.4 PROGRAMMABLE 4G MOBILE NETWORK ARCHITECTURE 



In this section, we introduce a possible high-level architecture for future mobile systems 
with the focus on programmability [37, 41]. To address the creation and provisioning of 
unanticipated services, the whole system has to be designed to be as flexible as possible. 
Openness and configurability, not only at the service level but also with respect to the whole 
system architecture, will attract third-party vendors to evolve the system as it unfolds and is 
therefore the key to viable solutions. As pointed out in Chapter 1, 4G systems are expected 
to provide an integration platform to seamlessly combine heterogeneous environments. 
Core abstraction layers cover hardware platform, network platform, middleware platform 
and applications. The abstraction layers can interact with each other using well-defined 
interfaces. Besides their regular cooperation in an operational setting, each layer can be 
configured separately and independently via configuration interfaces. 

The generic architecture for the network elements of a mobile network is shown in 
Figure 16.8 (excluding applications). In this architecture, the following abstraction layers 
are considered, each programmable with configurable components: 

• Middleware platform, typically with a virtual execution environment for platform- 
independent distributed processing. 

• The computing platform serves as a general purpose platform for processing stateful 
protocols, e.g. routing, QoS signaling or connection management. 

• The forwarding engine is in the data path of a network node and it connects the inter- 
face modules, e.g. by a switch matrix. This engine can be implemented as dedicated 
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Figure 16.8 Programmable network element model. 
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Figure 16.9 Mobile terminal architecture. 



hardware or as a kernel module of common operating systems. The forwarding engine is 
programmable for performance-critical tasks which are performed on a per-packet basis. 

• The interface modules are medium specific for different wireless or wired standards. 
They can be configured or programmed to adapt to new physical layer protocols or for 
triggering events in higher layers. 

There are different approaches to active networking in this architecture. Some approaches 
offer a virtual execution environment, e.g. a Java virtual machine, on the middleware layer. 
Some options also include native code in the computing platform, e.g. for flexible signaling. 
Others employ programmable hardware for forwarding. 

A key ingredient in these approaches is interfaces to the lower layers and programmable 
filters for identifying the packets to be processed in the active networking environment. The 
terminal architecture is shown in Figure 16.9. It consists of: 

• Middleware platform, typically with a limited virtual execution environment. 

• Smart Card, e.g. USIM for UMTS, which includes subscriber identities and also a small, 
but highly secure execution environment. This can be used ideally for personal services 
like electronic wallet. 

• Programmable hardware, which is designed for one or more radio standards. 

• Native operating system which provides real-time support, needed for stacks and certain 
critical applications, e.g. multimedia codecs. 

Compared with network elements, the SmartCard is a new, programmable component. 
Owing to resource limitations, the forwarding engine and computation platformjust collapse 
to one operating system platform. Also, the middleware layers are typically quite restricted. 
Service deployment and control of reconfigurations are complex since there is a split of 
responsibility between operator and manufacturer. For instance, the manufacturer has to 
provide or digitally sign appropriate low-level code for reconfigurations. On the other hand, 
the operator is interested in controlling the configuration to fit the user and network needs. 

There are a number of examples to show the importance of flexible networking protocols 
and the need of dynamic cross layer interfaces in future mobile networks. As an example 
we show that hand-over can be optimized by information about the user context [37, 42, 43] 
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Figure 16.10 Context-aware hand-over prediction. 

as shown in Figure 16.10. In this scenario, the terminal moves into an area covered by both 
AP2 and AP3. The problem is to decide which access point (AP) to choose. State of the art 
are many algorithms based on signal strength analysis or on available radio resources. Even 
if one AP is slightly better regarding these local measurements, the decision may not be the 
best. For instance, if the terminal in Figure 16.10 is on the train, it is obviously better to 
hand over to AP3, even if AP2 is more reachable for a short period of time. In many cases, 
the hand-over can be optimized by knowledge of terminal movement and user preferences. 
For instance, if the terminal is in a car or train, its route may be constrained to certain areas. 
Also, the terminal profile may contain the information that the terminal is built in a car. 
Alternatively, the movement pattern of a terminal may suggest that the user is in a train. A 
main problem is that hand-over decisions have to be executed fast. However, the terminal 
profile and location information is often available on a central server in the core network. 
Retrieving this information may be too slow for hand-over decisions. 

Furthermore, the radio conditions during hand-over may be poor and hence limit such 
information exchange. The idea of the solution is to proactively deploy a context-aware 
decision algorithm onto the terminal which can be used to assist hand-over decisions. A 
typical example is the information about the current movement pattern, e.g. by knowledge 
of train or road routes. For implementation, different algorithms can be deployed by the 
network on the terminal, depending on the context information. This implementation needs 
a cross layer interface, which collects the context information from different layers and 
makes an optimized decision about deploying a decision algorithm. 

16.5 COGNITIVE PACKET NETWORKS 

We now discuss packet switching networks in which intelligence is incorporated into the 
packets, rather than at the nodes or in the protocols. Networks which contain such packets are 
referred to as cognitive packet networks (CPN). Cognitive packets in CPN route themselves, 
and learn to avoid congestion and being lost or destroyed. Cognitive packets learn from their 
own observations about the network and from the experience of other packets. They rely 
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Figure 16.12 CP update by a CPN node. 

minimally on routers. Each cognitive packet progressively refines its own model of the 
network as it travels through the network, and uses the model to make routing decisions. 
Cognitive packets (CPs) store information in their private cognitive map (CM) and update 
the CM and make their routing decisions using the code which is in each packet. This code 
may include neural networks or other adaptive algorithms. Figure 16. 1 1 presents the format 
of a cognitive packet. The manner in which cognitive memory at a node is updated by the 
node's processor is shown in Figure 16. 12. In a CPN, the packets use nodes as 'rented space', 
where they make decisions about their routes. They also use nodes as places where they can 
read their mailboxes. Mailboxes may be filled by the node, or by other packets which pass 
through the node. Packets also use nodes as processors which execute their code to update 
their CM and then execute their routing decisions. As a result of code execution, certain 
information may be moved from the CP to certain mailboxes. The nodes may execute the 
code of CPs in some order of priority between classes of CPs, for instance as a function of 
QoS requirements which are contained in the identification field). 

As a routing decision, a CP will be placed in some output queue, in some order of 
priority, determined by the CP code execution. A CPN and a CPN node are schematically 
represented in Figures 16.13 and 16.14. CPs use units of information 'signals' to commu- 
nicate with each other via mailboxes (MBs) in the nodes. These signals can also emanate 
from the environment (nodes, existing end-to-end protocols) toward the CPs. CP, shown in 
Figure 16.11, contain the following fields [44]: 

(1) The identifier field (IF), which provides a unique identifier for the CP, as well as 
information about the class of packets it may belong to, such as its quality of service 
(QoS) requirements. 
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Figure 16.13 CPN node model. 




Figure 16.14 CPN example. 

(2) The data field containing the ordinary data it is transporting. 

(3) A cognitive map (CM), which contains the usual source and destination (S-D) 
information, as well as a map showing where the packet currently 'thinks' it is, the 
packet's view of the state of the network, and information about where it wants to 
go next; the S-D information may also be stored in the IF. 

(4) Executable code that the CP uses to update its CM. This code will contain learning 
algorithms for updating the CM, and decision algorithms which use the CM. 
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A node in the CPN provides a storage area for CPs and for mailboxes which are used 
to exchange data between CPs, and between CPs and the node. It has an input buffer for 
CPs arriving from the input links, a set of mailboxes, and a set of output buffers which are 
associated with output links. Nodes in a CPN carry out the following functions [44]: 

(1) A node receives packets via a finite set of ports and stores them in an input buffer. 

(2) It transmits packets to other nodes via a set of output buffers. Once a CP is placed 
in an output buffer, it is transmitted to another destination node with some priority 
indicated in the output buffer. 

(3) A node receives information from CPs which it stores in MBs. MBs may be reserved 
for certain classes of CPs, or may be specialized by classes of CPs . For instance, 
there may be different MBs for packets identified by different source-destination 
(S-D) pairs. 

(4) A node executes the code for each CP in the input buffer. During the execution of the 
CPs code, the CP may ask the node to decline its identity, and to provide information 
about its local connectivity (i.e. this is node A, and I am connected to nodes B, C, D 
via output buffers) while executing its code. In some cases, the CP may already 
have this information in its CM as a result of the initial information it received at its 
source, and as a result of its own memory of the sequence of moves it has made. 

As a result of this execution: 

(1) The CMs of the packets in the input buffer are updated. 

(2) Certain information is moved from CPs to certain MBs. 

(3) A CP which has made the decision to be moved to an output buffer is transfered 
there, with the priority it may have requested. 

We have been already discussing some networks which offer users the capability of 
adding network executable code to their packets. Additional material can be found in Ref- 
erences [44-68]. 

16.5.1 Adaptation by cognitive packets 

Each cognitive packet starts with an initial representation of the network from which it 
then progressively constructs its own cognitive map of the network state and uses it to 
make routing decisions. Learning paradigms are used by CPs to update their CM and reach 
decisions using the packet's prior experience and the input provided via mailboxes. In the 
adaptive approach for CPs, each packet entering the network is assigned a goal before it 
enters the network, and the CP uses the goal to determine its course of action each time it 
has to make a decision [44]. For instance if the CP contains part of a telephone conversation, 
a typical goal assigned to the packet should reflect the concern about the delay. A more 
sophisticated goal in this case could be delay and sequenced (in order) delivery. On the 
other hand, if these were data packets, the goal may simply be packet loss rate. These goals 
are translated into numerical quantities (e.g. delay values, loss probabilities and weighted 
combinations of such numerical quantities), which are then used directly in the adaptation. 
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An example is where all the packets were assigned a common goal, which was to minimize 
a weighted combination of delay (W) and loss (L) as 

G = aW + f3L (16.1) 

A simple approach to adaptation is to respond in the sense of the most recently available 
data. Here the CP's cognitive memory contains data which is updated from the contents of 
the node's mailbox. After this update is made, the CP makes the decision which is most 
advantageous (lowest cost or highest reward) simply based on this information. This ap- 
proach is referred to as the Bang-Bang algorithm. Instead some other learning paradigms 
for CPs can be also used like learning feedforward random neural networks (LFRNN) 
[57, 68] or random neural networks with reinforcement learning (RNNRL) [57]. Adaptive 
stochastic finite-state machines (ASFSM) [55, 56, 58] are another class of adaptive models 
which could be used for these purposes. 

16.5.2 The random neural networks-based algorithms 

For the reinforcement learning approach to CP adaptation, as well the feed-forward neural 
network predictor, the RNN [57] was used in Gelenbe et al. [44]. IT is an analytically 
tractable model whose mathematical structure is akin to that of queueing networks. It has 
product form just like many useful queueing network models, although it is based on 
nonlinear mathematics. The state qi of the ith neuron in the network is the probability that 
it is excited. Each neuron i is associated with a distinct outgoing link at a node. These 
quantities satisfy the following system of nonlinear equations: 

q f =k + (i)/[r(i) + k-(i)] (16.2) 

with 

j j 

Here uijt is the rate at which neuron i sends 'excitation spikes' to neuron j when i is excited, 
and w~j is the rate at which neuron i sends "inhibition spikes" to neuron j when i is excited 
and r(i) is the total firing rate from the neuron i. For an n neuron network, the network 
parameters are these n by n 'weight matrices' W + = || w~t\\ and W~ = \\wjj || which need 
to be 'learned' from input data. Various techniques for learning may be applied to the RNN. 
These include reinforcement learning and gradient-based learning, which are used in the 
following. 

Given some goal G that the CP has to achieve as a function to be to be minimized 
[i.e. transit delay or probability of loss, or a weighted combination of the two as in 
Equation (16. 1)], a reward R is formulated which is simply R — l/G. Successive measured 
values of the R are denoted by Rl,l — 1, 2, ... These are first used to compute a decision 
threshold: 

7} = or M + (1 - a)Ri (16.4) 

where a is some constant 0 < a < 1, typically close to 1. Now an RNN with (at least) as 
many nodes as the decision outcomes is constructed. Let the neurons be numbered 1 , . . . , n. 
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Thus for any decision i, there is some neuron i. Decisions in this RL algorithm with the 
RNN are taken by selecting the decision j for which the corresponding neuron is the most 
excited, i.e. the one with has the largest value of qj . Note that the Ith decision may not have 
contributed directly to the Ith observed reward because of time delays between cause and 
effect. Suppose that we have now taken the /th decision which corresponds to neuron j, 
and that we have measured the /th reward Rl. Let us denote by r, the firing rates of the 
neurons before the update takes place. We first determine whether the most recent value of 
the reward is larger than the previous 'smoothed' value of the reward, which is referred to 
as the threshold 7}_i. If that is the case, then we increase very significantly the excitatory 
weights going into the neuron that was the previous winner (in order to reward it for its new 
success), and make a small increase of the inhibitory weights leading to other neurons. If 
the new reward is not better than the previously observed smoothed reward (the threshold), 
then we simply increase moderately all excitatory weights leading to all neurons, except 
for the previous winner, and increase significantly the inhibitory weights leading to the 
previous winning neuron (in order to punish it for not being very successful this time). This 
is detailed in the algorithm given below. We compute 7}_i and then update the network 
weights as follows for all neurons i ^ j: 



If 7}_i < Ri 

-w+(i,j) <- w+(iJ) + R, 

— w~(i, k) <- w~(i, k) + R,/(n - 2), if k ^ j 

Else 

— u> + (i, j) <— w + (i, j) + Ri/(n — 2), if k^j 

— w~(i, k) <r- w~(i, k) + Ri 



Then we re-normalize all the weights by carrying out the following operations, to avoid 
obtaining weights which indefinitely increase in size. First for each i we compute: 



The probabilities qi are computed using the nonlinear iterations in Equations (16.2) and 
(16.3), leading to a new decision based on the neuron with the highest probability of being 
excited. 

16.5.2.1 Performance examples 

Simulation results are generated in a scenario as in Gelenbe et al. [44]. The CPN included 
10 x 10= 100 nodes, interconnected within rectangular grid topology. All link speeds were 
normalized to 1, and packets were allowed to enter and leave the network either from the 
top 10 nodes or the bottom 10 nodes in the grid. All the packets were assigned a common 
goal, which was to minimize a weighted combination of delay (W) and loss (L) as defined 



n 




(16.5) 



m=\ 



and then renormalize the weights with 



j) <- w+(i, j)ri/n 
j) <- u>-(i, j)n/n 
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Figure 16.15 Comparison of average delay through the CPN with Bang-Bang control based 
on estimated delay. 



by Equation (16.1). All the algorithms are allowed to use four items of information which 
are deposited in the nodes' mailboxes: 

(1) the length of the local queues in the node; 

(2) recent values of the downstream delays experienced by packets which have previ- 
ously gone through the output links and reached their destinations; 

(3) the loss rate of packets which have passed through the same node and gone through 
the output links; 

(4) estimates made by the most recent CPs which have used the output links headed 
for some destination d of its estimated delay Dd and loss Ld from this node to its 
destination. 

The value Dd is updated by each successive CP passing through the node and whose 
destination is d. Samples of the simulation results are given in Figures 16.15-16.18. 



16.6 GAME THEORY MODELS IN COGNITIVE RADIO NETWORKS 

Game theory is a set of mathematical tools used to analyze interactive decision makers. We 
will show how these tools can be used in the analysis of cognitive networks [71-77]. The 
fundamental component of game theory is the concept of a game, formally expressed in 
normal form as [69, 70]: 



G = (N,A,{ui}) 



GAME THEORY MODELS IN COGNITIVE RADIO NETWORKS 



647 



0.28 



0.24 



0.18 



















No 


control 






































Bang -Bang 

















0.5 0.6 

External arrival rate 



Figure 16.16 Comparison of average loss through the CPN with Bang-Bang control based 
on delay and loss. 
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Figure 16.17 Comparison of average delay through the CPN with reinforcement learning- 
based control using delay and loss as the goal. 

Where 

(1) G is a particular game; 

(2) N = {1, 2, ...,«} is a finite set of players (decision makers); 

(3) A,- is the set of actions available to player ; 

(4) A — A\ x A 2 x • • • x A n is the action space; and 

(5) {m } = {u i , M2 , • • • , u„ } is the set of utility (objective) functions that the players wish 
to maximize. 
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Figure 16.18 Comparison of average loss through the CPN with reinforcement learning- 
based control using delay and loss as the goal. 



Each player's objective function, m, , is a function of the particular action chosen by player 
i , at and the particular actions chosen by all of the other players in the game, a_, , and yields 
a real number. Other games may include additional components, such as the information 
available to each player and communication mechanisms. In a repeated game, players are 
allowed to observe the actions of the other players, remember past actions, and attempt to 
predict future actions of players. An action vector a is said to be a Nash equilibrium (NE) if 

iii(a) > Ui(bj,a^j)Vi € N, b t € A,- (16.6) 

Restated, an NE is an action vector from which no player can profitably unilaterally deviate. 
NE correspond to the steady-states of the game and are then predicted as the most probable 
outcomes of the game. 

A repeated game is a sequence of stages where each stage is the same normal form 
game. When the game has an infinite number of stages, the game is said to be an infinite 
horizon game. Players choose strategies (actions at each stage), based on their knowledge 
of the game-past actions, future expectations and current observations. These strategies can 
be fixed, contingent on the actions of other players or adaptive. These strategies can be 
also designed to punish players who deviate from agreed upon behavior. When punishment 
occurs, players choose their actions to minimize the payoff of the offending player. However, 
i is still able to achieve some payoff v, . Thus there is a limit to the how much a player can 
be punished. 

As estimations of future values of m, are uncertain, many repeated games modify the 
original objective functions by discounting the expected payoffs in future stages by S, where 
0 < S < 1 such that the anticipated value in stage k to player i is given by 



Mj,yfc(a) = S k Uj(a) 



(16.7) 
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It can be shown that, in a repeated game with an infinite horizon and discounting, for every 
feasible payoff vector v > for all i e N, there exists a S < 1 such that for all S e (<5, 1) 
there is an NE with payoffs v (Folk theorem [69]). To generalize the Folk theorem, given 
a discounted infinite horizon repeated game, nearly any behavior can be designed to be 
the steady-state through the proper choice of punishment strategies and S. Thus convergent 
behavior of a repeated game can be achieved nearly independent of the objective function. 

A myopic game is defined as a repeated game in which there is no communication be- 
tween the players, memory of past events or speculation of future events. Any adaptation 
by a player can still be based on knowledge of the current state of the game. As players 
have no consideration of future payoffs, the Folk theorem does not hold for myopic games 
and the convergence to steady-state behavior must occur through other means. Two conver- 
gence dynamics possible in a myopic game are the best response dynamic and the better 
response dynamic. Both dynamics require additional structure in the stage game to ensure 
convergence. Best response dynamic [70] refers to the case where, at each stage, one player 
i e N is permitted to deviate from a ; to some randomly selected action b t e A, if 

Ui(bi, a_ ( ) > Ui(ci, a-ijici ^ £>, e A,- and «,(£>;, > 

Better response dynamic [70] refers to the case where, at each stage, one player i e N 
is permitted to deviate from a, to some randomly selected action b, e A, if , a_, ) > 

Ui(ai,a-i). 

An S-modular game restricts {uj} such that, for all i e N, either Equation (16.8) or 
(16.9) is satisfied. 



3 Ui(a) 
3a; daj 
d 2 Ui(a) 
dai da; 



>(Nj^ieN (16.8) 
<0Wj^ieN (16.9) 



In the former case, the game is said to be supermodular; in the latter the game is said to 
be submodular. Myopic games whose stages are S-modular games with a unique NE and 
follow a best response dynamic converge to the NE when the NE is unique [71]. 

A potential game is a special type of game where {uj} are such that the change in value 
seen by a unilaterally deviating player is reflected in a function P: A — > di. All myopic 
games where the stages are the same potential game converge to a NE when decisions are 
updated according to a better response dynamic [70]. 

An exact potential game (EPG) is the game where there exists some function (EPF) 
P: A ->• m such that Vi € N, Va e A 

Ui(a u a-0 - Ui(bi,a-i)= P(a h a_,) - P(bi, a_ ( ) (16.10) 

A necessary and sufficient condition for a game to be an exact potential game is [72] 

d 2 Ui(a) d 2 uj(a) 



dai da; da; da. 



Vi,jeN,aeA (16.11) 



Coordination-dummy game [72] is a composite of a coordination game with identical 
interest function V and a dummy game with dummy function D, whose value is solely 
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dependent on the actions of the other players and can be expressed as 

Ui(a) = V(a) + Diia-i) (16.12) 

An EPF for this game can be written as 

P(a) = V(a) (16.13) 

Self-motivated games' utility functions are a function solely of that player's actions. 

Ui(a) = hi(a) (16.14) 

A self-motivated game can be shown to be an EPG by introducing the EPF as 

P{a) = Y JieN hi(a i ) (16.15) 

Bilateral symmetric interaction (BSI) game [73] refers to the case where every player's 
objective function can be characterized by 

Mi (a) = J2jeN\{i) Wi M' a i) ~ h i( a i) (16.16) 

where 10,7: A,- x Aj — > di and ft,-: A; — > di such that for every (a,, a,) e A,- x Aj, 
Wijiai, aj) — Wji(aj, a,). An EPF for a BSI game is given by 

i-i _ 
P{a) = ^£ j w i j{a i ,aj) - ^hMi) (16.17) 

ieN j=l ieN 

Ordinal potential games ( OPG) refer to the case where there exists some function (OPF) 
P: A -> 9t such that, in addition to Equation (16.10), we also have 

Ui{a.i,a-i) > Ui{bi,a-i) P(a,, a_,) > P(bi,a-i), V/ e N,Va e A (16.18) 

All EPG are also OPG. 

16.6.1 Cognitive radio networks as a game 

The cognitive radios in the network form the game's set of decision makers. The set of 
physical layer parameters which a radio is permitted to alter forms the player's action set. 
From these action sets, the action space is formed. Preference relations over the action space 
are formed by an exhaustive evaluation of the adaptation algorithms. Objective functions are 
then formed by mapping the preference relations to the real number line so that preferable 
action vectors are larger than less preferable action vectors. 

16.6. 1. 1 Distributed power control 

As an example we examine distributed power control algorithms within the context of CRNs 
[71, 74, 77]. Below, the following notation is used: 

• TV, the set of decision-making (cognitive) radios in the network; 

• i,j, two different cognitive radios, i, j e N; 

• Pj, the set of power levels available to radio j; this is presumed to be a segment of the 
real number line Ot; 
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• pj , a power level chosen by j from Pj ; 

• P, the power space (9t") formed from the Cartesian product of all Pj\ P = Pi x 

Pi x • • •?„; 

• p , a power profile (vector) from P formed as p = { p \ , p 2 , ■ ■ ■ p„ } ; 

• Uj(p), the utility that j receives from p; this is the specific objective function that j is 
looking to maximize. 

Based on these conventions, a power control game G can be formulated as G — 
(N,P,{uj}). 

16.6. 1.2 Repeated power games 

MacKenzie and Wicker [74] consider a discounted repeated power control game imple- 
mented on a packet-based network wherein the original objective function for each radio 
j is the modified function of throughput given as 

MP) = R [1 - 2BER(p)] L /pj (16.19) 

where R is the transmission rate, L is the packet length and BER is the bit error rate which 
is a function of the S1NR seen by j. As the modeled game has an infinite horizon, a CRN 
implemented in this manner will exhibit convergent behavior if: 

(1) some mechanism exists for broadcasting the desired operating vector, the discount 
factor, and the punishment strategy; 

(2) there is full knowledge of the environment so the radios can differentiate deviant 
behavior from fades and jammers external to the CRN; 

(3) there is knowledge of the action chosen by each radio at each stage. 

16.6.1.3 S-modular games 

Altman and Altman [71] examines the application of super-modular games to distributed 
power control. Thus the objective functions for this game are characterized by Equations 
(16.8) and (16.9). Herein each game follows a general updating algorithm (GUA), which is 
actually a best response dynamic. Thus if these network games have a unique NE, behavior 
converges to the NE from any initial p. For a CRN which satisfies this characterization, the 
conditions for convergence are: 

(1) The adaptation algorithms must incorporate perfect knowledge of the objective 
function. 

(2) The network must have a unique steady state. 

(3) Some method must exist for measuring current performance and for sensing rele- 
vant environmental parameters. Depending on the particular adaptation algorithm, 
it might be necessary to know the number and type of other radios in the network. 

Altman and Altman [71] assert that the classes of power control algorithms considered in 
by Yates [75] are S-modular games. Yates examines power control in the context of the 
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uplink of a cellular system under five scenarios: 

(1) fixed assignment where each mobile is assigned to a particular base station; 

(2) minimum power assignment where each mobile is assigned to the base station where 
its SINR is maximized; 

(3) macro diversity where all base stations combine the signals of the mobiles; 

(4) limited diversity where a subset of the base stations combine the signals of the 
mobiles; and 

(5) multiple connection reception where the target SINR must be maintained at a number 
of base stations. 

In each scenario, each mobile, j, tries to achieve a target SINR yj as measured by a 
function, /(p). /(p) is the standard effective interference function which has the following 
properties: (1) positivity, e.g. /(p) > 0; (2) monotonicity, e.g. if p > p*, then /(p) > /(/?*); 
(3) scalability, e.g. for all a > 1, a/(p) > /(ap), where the convention that p > p* means 
that p.- > p,-*V/ e N. 

Single-cell target SINR games are also ordinal potential games. To prove it, consider a 
modified game where the action sets are the received power levels at the base station. A 
received power level for mobile j, rj, is the product of its path loss to base station k, hj^, 
and its transmitted power level. Then the objective functions of a target SINR game can be 
expressed as 



U J(P) = 1 - i -Yj + 



(x>HF 

^ i€N\j / -I > 



(16.20) 



where is the noise power at k. Expanding the above expression gives 

Uj(p) = l-yf- al - 2 Yj a k - ( r,\ " 2 Yj ( r < ) " 2(Jk ( Ki 

-rj + 2y jrj + 2a k rj + 2rj ( r > ) ( 16 - 21 ) 

The first line of Equation (16.21) is a dummy game; the following three terms in 
Equation (16.21) are a self-motivated game, EPG. The final term is also an EPG as it 
is a BSI game. Since a composite game formed from a linear combination of two EPGs is 
itself an EPG, it is seen that the target SINR game is an EPG. As other forms of target SINR 
games are ordinal transformations (OT) of Equation (16.21), all target SINR games are 
OPG, since OT of an OPG is an OPG It can be also shown that all target throughput power 
control games are OPG (all throughput maximization power control games are OPG). 



16.6. 1.4 Nonlinear group pricing 



Goodman and Mandayam [76] consider a scenario wherein mobile devices are trying to 
maximize a function of their throughput. Note that the only NE of games whose utility 
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functions are pure throughput function is the power vector where all mobiles transmit 
at maximum power. To avoid this problem, Goodman and Mandayam [76] introduce the 
modified throughput function as 

uj(j>) = - cRpj (16.22) 
Pj 

where R is the transmission rate, c is a constant, B is the battery life, and T is the throughput 
function. This can neither be shown to be an OPG nor an OPG. The left-most term is generally 
only an OPG, whereas the pricing function is an EPG as it is a self-motivated game. 

Goodman and Mandayam [76] use a best response dynamic in their experiments, which 
converge to a NE. Although not stated in Goodman and Mandayam [76], Saraydar et al. 
[78] show that Equation (16.22) is indeed a supermodular game. Also note that, without 
the cost function, Equation (16.19) is a particular instance of Equation (16.22). So a CRN 
implementing the repeated games of References [74], [76] or [78] is a supermodular game 
and will exhibit convergent behavior if the properties specified above for 5-modular games 
are satisfied. However, it should be noted that, when left as a repeated game, there is greater 
flexibility in dynamically selecting the steady-state. 



16.6. 1.5 Nonlinear group pricing 

Sung and Wong [77] consider another single cell distributed power control game with 
pricing characterized by 

Uj (p) = RjT(p) - XhjPj (16.23) 

where A is a constant and the base station subscript has been dropped in this case as all 
mobiles are communicating with the same base station. The pricing function is to reflect 
that the damage caused by pj to the other devices in the network is a function of the relative 
difference in powers rather than absolute power level. Note that Equation (16.23) is just 
a composite game of throughput maximization and a cost function. As we have indicated 
that throughput maximization is an OPG, Equation (16.23) can only be guaranteed to be an 
OPG only if, though not necessarily if, the pricing function has an EPF which can only be 
true if Equation (16.1 1) is satisfied. Differentiating the price function twice yields: 

d 2 Q _ (T,keN h k p k ) 2 khjhj - 2hj (J^keN h kPk)gi(P) 
d P id Pj' {JlkeNhkPk) 4 

d 2 Cj _ (J2keN h k pk) 2 khjhj - 2hj (J2keN h kPk)gj(p) 

d Pj d P>~ (HkeNhkPk) 4 

where g t (p) = Xhi ^ keN h^pt — Xhjpj. Further evaluation leads to the result that this price 
function has an EPF if hj pj — h t pi . Note that this is only satisfied when the received powers 
are identical, which will generally not be true. Thus the cost function does not have an EPF 
and the nonlinear group priced game is not an OPG. Also note that Equation (16.23) cannot 
be guaranteed to be a supermodular game either as properly chosen power vectors p and p* 
will yield different signs for the second partial derivative of the cost functions. 
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Therefore neither the better response dynamic nor the best response dynamic will as- 
suredly converge, and a more complex convergence dynamic is required. As the repeated 
game dynamic is guaranteed for convergence, it can still be used. Thus this modification of 
the pricing function significantly complicated the network. 

Additional examples of game theory applications in resource allocation modeling can 
be found in References [116-127] and routing in [79-115]. 



16.7 BIOLOGICALLY INSPIRED NETWORKS 

Current Internet protocols were never planned for the emerging pervasive environments 
where the amount of information will be enormous. The communications requirements 
placed by these protocols on the low cost sensor and tag nodes are in direct contradiction 
to the fundamental goals of these nodes, being small, inexpensive and maintenance-free. 
Biological systems provide insights into principles which can be adopted to completely 
redefine the basic concepts of control, structure, interaction and function of the emerg- 
ing pervasive environments. The study of the rules of genetics and evolution combined 
with mobility leads to the definition of service-oriented communication systems which 
are autonomous and autonomously self-adaptive. Based on References [128-131] in this 
section we discuss how this paradigm shift, which views a network only as a randomly self- 
organizing by-product of a collection of self-optimizing services, may become the enabler 
of the new world of omnipresent low cost pervasive environments of the future. 



16.7.1 Bio-analogies 

In Carreras etal. [131] the depicted scenario services are associated with living organisms. 
Service is defined by chromosomes. In this way service evolves and adapts to the envi- 
ronment constantly and autonomously. By analogy with living organisms, chromosomes 
are collections of genes that are the smallest service (related) data unit and inborn intelli- 
gence/instincts and thus represent all the information needed for the organism to function 
and service to be executed. Like in nature, this concept defines a complete life-cycle of the 
organisms and therefore of services. The life cycle starts from the birth of an organism, 
goes through the reproduction and ends with the death. Reproduction and evolution occur, 
applying evolution rules inherited from nature. Fitness is measuring the correspondence 
of the organism's genetic information with the environment and determines the exchange 
of information (genetic information). Therefore no-end-to-end communication concept ex- 
ists in these systems. Information is only exchanged as needed, locally, between mating 
organisms. 

Environment is determining the natural selection based on the fitness of the organisms, 
with the environment leading to the best services possible as a function of the environment. 
In this concept the service is the organism. A scenario is envisioned where users will be 
more and more interested in a service able to provide reliable localized information. The 
role of the service will be, for instance, to provide answers to questions like How is the 
weather around the train station? or Where will I find a free parking space around there? 
Services will be hosted on users' devices and will go around through the physical movement 
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of the users. Each service is constituted by a program and its related data that is organized 
into chromosomes. Each chromosome consists of: 

(1) data that is the genetic information of the organism, organized in genes; 

(2) a plugin that stores a syntax notation describing the actions dictated by the chro- 
mosome and the fitness (degree of attraction) operator yielding natural selection 
through preferred mating; 

Genes are a tuple of information and consist of: (a) value; (b) timing information; (c) in- 
formation source ID/location information; and (d) other data depending on the service. 

Organisms are diploid, which means that there is always two homologous chromosomes 
associated with each service. The two homologous chromosomes have the same genes but 
in different forms. Alleles are different forms of the same gene and correspond to different 
values in the tuple of gene information. They may differ in timing information or in the 
source value information. Each allele may be dominant or recessive depending on the 
service behavior. Having two chromosomes allows us to estimate the reliability of the data. 
We would probably always choose the youngest data value to be the actual one if it is a 
parking lot, but we might also average the sensor data if it represents temperature. 

The choice of the preferred value among the two reflects the concept of dominant and 
recessive genes. As in nature, recessive information enables the service to survive in different 
environments, providing the service with higher resilience against data corruption and fraud, 
and may even allow for additional features. Analogy with the life cycle is also possible. In 
this concept service is born when the user downloads the chromosome onto his device. From 
that moment on, the user is able to interact with the other organisms (i.e. users carrying 
chromosomes) and with the environment where the users are physically moving. When 
gathering information from the environment the service grows. While growing, the service 
improves its functionalities, meaning that it becomes able to increase performance. When 
a user meets another user while moving, services may reproduce and produce offspring. It 
is in this phase that evolution and natural selection occur. In order to be able to reproduce 
the service must satisfy some fitness requirements, since it is not willing to spread useless 
information. We assume a service to be dead when it cannot reproduce anymore. 

Analogies with birth, growth, reproduction and death are even further elaborated in this 
concept. As we have already said, the service is born when the user obtains (downloads) an 
empty chromosome of a certain service that consists only of the plugin. From that instant 
on the user can read data from sensors and use the syntax definition from the plugin. At this 
stage the user is haploid, i.e. it has only one chromosome per service. When reading sensor 
data, the user fills the chromosomes both at the same time. This information is in any case 
more reliable than the previous information on the chromosomes. 

The concept of mating is performed the following way: in meiosis, the diploid cell splits 
into two haploid reproductive cells (the eggs or sperm). In real life, four haploid cells would 
appear because of one step of copying the chromosomes, including cross-over. Each two 
copies would be identical. This means that the chromosome pair is being split and copied. 
Two packets are sent out one after another containing one chromosome each. In the best 
case when all the sent packets reach the receiver, it has four different combinations of the 
chromosome pairs. This is the best case; the user may for energy reasons decide to send out 
only one chromosome, or receive only one because of packet loss. A selection has to take 
place to decide which of these combinations survives. This selection could be influenced by 
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the quality of the plugin (by the version number). Having such a selection can help to spread 
new versions of plugins. It also may help to repair broken chromosomes (that were damaged 
during the wireless transmission). In a sense, we allow for truly spontaneous mutations and 
we may even find that mutations spread. It remains to be seen if this is any good. The 
selection occurs as a consequence of localization and age. The fitness of a chromosome is 
defined as the average of the timing information of the genes weighted with the localization 
information. In this sense the environment participates in the selection, since the survival of 
a service also depends on where the user is (but not only) when he mates with another user. 

If the sensor data in the chromosome is too old, it is very likely useless. Thus we forbid 
sending out the chromosome after a certain threshold (coded in the plugin). This way the 
chromosome can die, but it may be reborn through reading new sensor data or receiving a 
fresh chromosome. The service is considered alive as long as it is able to answer to questions. 
Death is therefore a consequence of outdated chromosomal information (sensor-gathered 
information is aging). It is in the interest of the user to exchange information and to gather 
sensor information and this same interest drives the service instinct to survive. This defines 
a complete life cycle of the service. 

As an example of the environmental monitoring applications, Carreras et al. [132] de- 
scribe a parking lot finding application as a possible application scenario of the above 
concept for the nomadic sensor network: 



Each parking spot in a city is equipped with a sensor that is capable of notifying whether 
the parking lot is free or not together with its geographical position (or ID). Each user 
subscribing to the service is equipped with a hand-held device that communicates both 
with the sensor nodes and with other users who subscribed to the service. The users, 
while moving around the city, collect information on the status (free/not free) of the 
parking lots they encounter on their way. This information is stored in their devices, 
together with a time stamp of when the information has been collected. Whenever two 
users come into communication range, they exchange their knowledge of the parking 
lot status (and update their own if the information is 'fresh' enough). The basic service 
of the system to the user would be the assistance in finding the nearest parking lot. 
The user queries his device (which might be some PDA), asking for the nearest free 
parking lot in a specific area of the city. The device will provide this information based 
on the data that has stored so far. Of course, the information cannot be considered 
real-time. 



16.7.2 Bionet architecture 

The bio-networking architecture [ 1 30-1 34] applies key concepts and mechanisms described 
above to design network applications. One of the key concepts in biological systems is 
emergence. In biological systems, beneficial system properties (e.g. adaptability) often 
emerge through the simple and autonomous interactions among diverse biological entities 
(see the above example). The bio-networking architecture applies the concept of emergence 
by implementing network applications as a group of distributed, autonomous and diverse 
objects called cyber-entities (CEs). In Itao et al. [134], analogy to a bee colony (a network 
application) consisting of multiple bees (CEs) is used. Each CE implements a functional 
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service related to the application and follows simple behaviors similar to biological entities, 
such as reproduction, death, migration and environment sensing, as discussed in the previous 
section. 

In the bio-networking architecture, CEs are designed based on the three principles de- 
scribed below in order to interact and collectively provide network applications that are 
autonomous, scalable, adaptive, and simple. 

(1) CEs are decentralized -there are no central entities to control and coordinate CEs (i.e. 
no directory servers and no resource managers). Decentralization allows network 
applications to be scalable and simple by avoiding a single point of performance 
bottleneck and failure and by avoiding any central coordination in developing and 
deploying CEs. 

(2) CEs are autonomous - CEs monitor their local network environments, and based 
on the monitored environmental conditions, they autonomously interact without any 
intervention from human users or from other controlling entities. 

(3) CEs are adaptive to dynamically changing environnemental conditions (e.g. user 
demandes, user locations and resource availability) over the short- and long-term. 

The bionet platform described in Suzuki et al. [130], and presented in Figure 16.19, provides 
an execution environment for CEs. It consists of two types of software components. The first 
type of components, supporting components, abstracts low-level operating and networking 
details (e.g. network I/O and concurrency control for executing CEs). The second type 
of components, runtime components, provides runtime services that CEs use to perform 
their services and behaviours. The bionet platform is implemented in Java, and each bionet 
platform runs on a Java virtual machine (JVM). Each CE is implemented as a Java object 
and runs on a bionet platform. 

As shown in Figure 16.20, a CE consists of three main segment: attributes, body and 
behaviours. Attributes carry descriptive information regarding the CE (e.g. CE ID and 
description of a service it provides). The body implements a service that the CE provides 



Platform ( CE ) 



representative 




CE 




CE, cyber-entity. 

Figure 16.19 Bionet platform architecture. (Reproduced by permission of IEEE [130].) 
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Figure 16.20 Design of a CE. 



and contains materials relevant to the service (e.g. data, application code or user profiles). 
CE behaviors implement nonservice-related actions that are inherent to all CEs. Examples 
of behavior include migration, reproduction and energy exchange. More information on 
different segments of the architecture can be found in Suzuki et al. [130]. 
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Network Deployment 



17.1 CELLULAR SYSTEMS WITH OVERLAPPING COVERAGE 

The concept of cellular communication systems is based on the assumption that a mobile user 
is served by the base station that provides the best link quality [1,2]. Spectrum allocation 
strategies, discussed in Chapter 12, are based on this assumption. In many cases, however, 
a mobile user can establish a communication link of acceptable quality with more than one 
base. Therefore, at many locations there is overlapping coverage, usually by nearby base 
stations [3]. 

The coverage overlap can be used to improve the system performance. Several schemes 
that consider this have been suggested [4-8] . Generalized fixed channel assignment (GFCA), 
a scheme that allows a call to be served by any of several nearby base stations, was considered 
in Choudhury and Rappaport [4] . Directed retry, discussed in References [5,6], allows a new 
call that cannot be served at one base to attempt access via a nearby alternative base. Load 
sharing is an enhancement of directed retry that allows calls in congested cells to be served 
by neighboring base stations. In Chu and Rappaport [7] overlapping coverage for highway 
microcells was considered. The use of overlapping coverage with channel rearrangement 
was discussed in Chu and Rappaport [8]. 

Reuse partitioning [9-11] can also improve traffic performance of fixed channel as- 
signment (FCA). The method divides the channels into several disjoint partitions. These 
partitions are associated with different cluster sizes (or reuse factors). Channels are allo- 
cated to base stations according to these cluster sizes. To meet the same signal quality 
requirement, channels corresponding to smaller cluster sizes are used within a smaller area 
than that for channels associated with larger cluster sizes [9]. Since channels are reused 
more often for a smaller cluster size, there may be more channels available at a base in 
reuse partitioning than in FCA. Therefore, improved traffic performance can be obtained. 
Because there is a fixed relationship between channels and base stations, reuse partitioning 
is a fixed channel assignment scheme. In this section the acronym FCA is used only to refer 
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fixed channel assignment without utilizing overlapping coverage or reuse partitioning. That 
is, in FCA all channels are allocated using a single cluster size and overlapping coverage is 
not exploited. 

When overlapping coverage exists in a system and is being used to provide enhanced 
access, users in overlapping areas may benefit. However, this may be at the expense of users 
in nonoverlapping areas who may encounter increased blocking or handoff failure because 
of the higher channel occupancy of the system. Generally, overlapping areas tend to occur 
at the periphery of cells (i.e. distant from bases). In reuse partitioning, on the other hand, 
calls that are close to base stations can access channels from both the smaller cluster and 
larger cluster partitions. Those calls distant from bases cannot use channels from partitions 
of smaller cluster sizes. Such calls may in fact do more poorly than in FCA. When both 
overlapping coverage and reuse partitioning are used, they can complement one another. 
Such a combination will be discussed in this section. 

The system layout is shown in Figure 17.1. The cell radius, r, is defined as the distance 
from a base to a vertex of its own cell. The coverage of a base is the area in which users can 
establish a link with acceptable signal quality with that base. This area can be modeled by a 
circle with the center at the corresponding base station. The coverage radius, R, is defined 
as the distance from a base to its coverage boundary. 

The coverage of a base is overlapped with coverages of neighboring bases. Therefore, 
calls can potentially access one, two, three or even more base stations depending on the 
platform location and the ratio of the coverage radius to the cell radius (R/r). For the 




Figure 17.1 System layout for overlapping coverage. 
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Figure 17.2 Three kinds of regions in the coverage of a base. 

interesting range of R/r, there are three kinds of regions in which calls have potential 
access to one, two or three base stations. These regions in Figure 17.2 are denoted A\, A2 
and A3, respectively. A\ is the nonoverlapping region while both A2 and A3 are overlapping 
regions. We consider (without loss of generality) the cell radius, r, to be normalized to 
unity. The coverage radius, R, is determined by the requirement of the link quality. Once 
R is found, the percentage of a cell that belongs to region A\, A2 or A3 can be calculated. 
They are denoted by p\, p2 and pi, respectively. These relationships are discussed letter in 
the section. 

Reuse partitioning that has two partitions of channels, denoted a and b, is used. Systems 
with more partitions can be considered similarly. Channels of these two partitions are 
referred to as a- and £>-type channels, respectively. These two partitions are used with 
different cluster sizes, N a and Nt, to allocate channels to base stations. Channels of type a 
are equally divided into N a groups each with C a channels. Similarly, fo-type channels are 
equally divided into Nf, groups each with Ct, channels. Every base station is assigned 
one group of channels from each partition in such a way that co-channel interference is 
minimized. As a result, there are totally C a + Ct channels available at a base. Let Cj 
denote the total number of available channels and / denote the fraction of channels that are 
assigned to partition a. So, 

f = N a -C a /C T and l-f = N b -C„/C T (1) 

Since N a and Nj, are different, N a is assumed to be the smaller one. Consequently o-type 
channels must be used in a smaller area. Channels of type a are intended for use by users 
who can access only one base, which corresponds to users in region A\. Channels of type 
b can be used by all users. Figure 17.3 shows the channel assignment and the available 
channel groups in a specific region for such a system with N a = 3 and Nb — 7. The three 
groups of a-type channels are labeled a.\ , 02 and a 3 < an d the seven groups of fo-type channels 
are labeled b\,b2, . . ., b-j. Because of overlapping coverage, users in region A2 can access 
two groups of channels from partition b. Similarly, users in A3 can access three groups of 
Z?-type channels. 
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Figure 17.3 The relationship between channel groups and various regions for N a = 3 and 
N b = 7. 

Priority to handoff calls is given at each base, by allocating C/, fl of the C a a-type channels 
to handoffs. Similarly, Chb of the Cb £>-type channels are reserved at a base. Specific channels 
are not reserved, only the numbers, C/ m and Chb ■ As a result, new calls that arise in region A \ 
can access C a — Cha fl-type channels and Cj — Chb b-type channels at the corresponding 
base. New calls that arise in region A 2 or A3 can access Cb — Chb channels of fo-type at 
each potential base. New calls that arise in region A\ are served by £>-type channels only 
when no a -type channel is available. When more than one base has a channel available to 
serve a new call that arises in region A2 or A3, the one with the best link quality is chosen. 
Most likely, it is the nearest one under uniform propagation and flat terrain conditions. 

The possible types of handoffs are shown in Figure 17.3. The received signal power is 
inversely proportional to the distance raised to an exponent, y, which is called path loss 
exponent. The coverage radius, R, is determined such that the worst case SIR of a-type 
calls is equal to the worst case SIR of £>-type calls. Therefore, the desired R is the one that 
satisfies SIR fl (^, N a , y) = SlRb(R, Nb, y). Mobile users are assumed to be uniformly 
distributed throughout the service area. Call arrivals follow a Poisson point process with 
a rate A„ per cell, and this rate is independent of the number of calls in progress. Session 
duration, T, is a random variable with a negative exponential probability density function 
with mean f{— fi~ l ). An a (or £>)-type call resides within the pertaining region during the 
dwell time T a (or Td). 

These are random variables with a negative exponential distribution of mean T a (= /U." 1 ) 
(or td(— )• t a ar, d td are assumed to be proportional to the radii of the corresponding 
areas. Although region A\ is not circular, an equivalent radius (which is the radius of a 
circle that has the same area as A{) can be used. System modeling is based on using two 
variables to specify the state of a base. One is the number of a -type channels in use; 
the other is the number of £>-type channels in use. A complete state representation for the 
whole system will be a string of base station states with two variables for each base. This 
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state representation can keep track of all events that occur in the system. Unfortunately, 
the huge number of system states precludes using this approach for most cases of interest. 
A simplified approach is to decouple a base from others using average teletraffic demands 
related to neighboring bases. This is similar to the approach used in References [13, 14]. 
As a result, the state, s, is characterized from a given base by a(s), b(s), where a(s) is the 
number of a-type channels in use in state s, and b(s) is the number of b-type channels in use 
in state s with constraint 0 < a(s) < C a and 0 < b(s) < Cb- The state probabilities, P(s), 
in statistical equilibrium are needed for determining the performance measures of interest. 
To calculate state probabilities, the state transitions and the corresponding transition rates 
must be identified and calculated. Owing to overlapping coverage and reuse partitioning, 
state transitions result from six driving processes: 

(1) new call arrivals; 

(2) call completions; 

(3) handoff departures of a type; calls that use a-type channels are a-type calls and those 
that use b-type channels are b-type calls. An a-type call initiates a handoff (a-type 
handoff) when it leaves region A\. An a-type handoff may be an inter-base or an 
intra-base handoff depending on which base continues its service (see Figure 17.3); 

(4) hand-off arrivals of a type; 

(5) handoff departures of b type; a £>-type call initiates a handoff (Z?-type handoff) only 
when it leaves the coverage of the serving base. A fe-type handoff that enters region A t 
of a neighboring base (out of the coverage of the serving base) can access channels 
from both a-type and b-type (a-type channels first) at that target base. A b-type 
handoff that enters region A2 can access all channels of £>-type at each potential 
target base; and 

(6) hand-off arrivals of b type. 

The transition rates from a current state s to next state s„ due to these driving processes are 
denoted by r n (s, s n ), r c (s, s n ), r da (s, s n ), r ha (s, s n ), r dh (s, s n ), and r hh (s, s n ), respectively. 
The details for state transition rate derivation and solving the system of state probabilities 
equations is based on the general principles discussed in Chapter 6. Some specific details 
can be found also in References [7, 8, 13, 14-16] and especially [17]. 

Some examples of performance results with C T — 180, N a — 3, N b — 9, f — 100 s, 
t d — 60 s, Cha — 0, Chb = 0 and y — 4, are shown in Figures 17.4-17.6. 



17.2 IMBEDDED MICROCELL IN CDMA MACROCELL NETWORK 

Combining a macrocell and microcell with a two-tier overlaying and underlaying structure 
for a CDMA system is to become an increasingly popular trend in 4G wireless networks. 
However, the cochannel interference from a microcell to macrocell and vice versa in this 
two-tier structure is different from that in the homogeneous structure (i.e. consisting of only 
macrocells or only microcells). 
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Figure 17.4 Overall blocking probability for FCA, coverage overlap and coverage overlap 
with reuse partitioning. 
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Figure 17.5 Forced termination probability for FCA, coverage overlap, and coverage over- 
lap with reuse partitioning. 
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Figure 17.6 Blocking probability in areas A\, A 2 and A3 for FCA, coverage overlap and 
coverage overlap with reuse partitioning. 
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Figure 17.7 Macrocell/microcell overlayed/underlayed structure. 



A possibility of embedding a CDMA micro cell into a TDMA macrocell is discussed 
in References [18, 19] and a CDMA micro cell into a CDMA macrocell in Wu et al. [20]. 
Figure 17.7 presents the geometry of overlayed and underlayed CDMA macrocell (radius 
7?)/CDMA microcell (radius r) structures where a microcell is located at distance d\ from 
the BS of the central macrocell. Propagation loss before breakpoint (bp) is assumed to be 
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proportional to d 2 , otherwise, it is proportional to d A , where d is the distance away from 
the transmitter. The received power is represented as 



(V47T) 2 P, 
d 2 

(h m h b ) 2 P t 
d* ' 



d <bp — 



otherwise 



4jth m hb 



(17.1) 



where P t is the power of the transmitter, h m and h\, are the height of the mobile and BS 
antennas respectively, and bp is the breakpoint [17-20]. The MS will access a cell BS with 
the strongest pilot signal. After an MS determines to access a certain cell BS, the cell BS 
will verify whether or not the ratio for uplink signal to interference (C/I) from a mobile is 
less than a threshold of (C/7) tu . If the C/I is less than the threshold of (C//) tu , the MS is 
inhibited and blocked. In other words, for a call to be accepted we should have 



M N 
i=l,i^ j k= 1 



(17.2) 



where P T j is the power received from the desired jth MS by a dedicated BS, P[ k is the power 
transmitted from the £th MS belonging to other BSs, and (fi)' k is the path loss from the kth MS 
to the dedicated BS. The first and second terms in the denominator represent the interference 
from other MSs in the same BS and from MSs belonging to other BSs, respectively. Thermal 
noise is rather small relative to the cells' interference, so it is neglected. If P r is increased, 
the interference from other cells is relatively smaller so that the C/I requirement for newly 
active users can be satisfied. 



17.2.1 Macrocell and microcell link budget 

For the handoff analysis we want to find the point where the strength of pilot signal emitted 
from a microcell BS is equal to that from a macrocell at the microcell boundary. The 
microcell boundary is assumed here to be distant from the macrocell bp. The distance from 
the microcell BS to the center of microcell is x. Therefore, the pilot power (emitted from 
microcell BS) received at boundary point D shown in Figure 17.7 is represented as 

(h m h hs ) 2 a s P t% 

P IS D — " 77 U IS) 

(r — x) 4 

where a s is the power fraction of the pilot signal for microcells. Similarly, the pilot power 
received from the macrocell BS at boundary D is represented by 

PriD = (WV^u 

(di - x) 4 

where a/ is the ratio of pilot power to the BS's downlink power and d\ is the distance 
between the centers of macrocells and microcells. The above equation is similar to (17.1). 
The subscripts xs (or xl) of P xs (or P x \) denote symbols for the microcell (small cell) or 
for the macrocell (large cell). Since the strengths of the pilot signals emitted from both the 
microcell BS and macrocell BS at D point are equal (i.e. P rsD = P r iu), the transmission 
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power emitted from both BSs must satisfy the following formula: 

hlAr - x)% 



hlXdi - x) 4 a s 



Pa (17.5) 



Similarly, it holds for point E which is on distance r + x from the B S as shown in Figure 17.7. 
Thus, we obtain 

(h m h bs ) 2 a s P ts (h m h hi ) 2 aiPti 
P rS E = — , , , 4 — and P llE = (17.6) 
(r + xf (di + rp 



Equating P IiE and P r \ E yields 



(ft bs ) (di+r) (a s ) < ' 17 ' 7 ' ) 



Dividing Equation (17.5) by Equation (17.7) gives x = r 1 jd\, which means that the mi- 
crocell BS must shift from its center by x distance. When d\ is getting smaller, the value 
x is getting larger. This fact will cause an asymmetric environment and make the site 
engineering more difficult. Substituting x — r 2 /d\ into Equation (17.5) gives the power 
ratio of 



_ hl s d^a s 



(17.8) 



The downlink C/Is for the central macrocell and for the microcell can be expressed as [1]: 

P a (l - ai ) 



N (17.9) 



fis(l-«s) ^ 

C ^ - M ' (17.10) 



[i - ^sr^] PtsP' + PtsP + E PtiA 



where and M denote the number of active users in the central macrocell BS and microcell 
BS, p, P' and $ are path loss for macrocell, microcell and adjacent macrocell, respectively, 
and the third term in the denominator represents the interference from the six adjacent 
macrocells. The above formulas are the decision rules for the downlink to accept a newly 
active MS. For a given threshold C/I they determined maximum TV and M (capacity). 
The uplink C/h measured by both the macrocell and microcell BSs are derived as 



C 



M , \ 4 /l \ 2 



(17.11) 
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Figure 17.8 Cell geometry for calculation of interference. 



and 

C\ PI 



1 (M-DPZ + E^d^yV/, 



(17.12) 



respectively, where P r and P r ' are mobile powers received by the macrocell's BS and mi- 
crocell's BS, respectively, M and N denote the number of active mobiles in each microcell 
and macrocell, respectively. The first term and the second term in the denominator repre- 
sent the interference from other MSs in the same BS and from all MSs in the other kind 
of BS, and r, and d; are the distance from the ;th mobile to its own BS and to the other 
kind of BS, respectively. Finally, the third term I\\ (or I\ s ) represents the interference from 
adjacent macrocells to the central macrocell BS (or microcell BS). The above formulas are 
the decision rules for the uplink to accept a newly active MS. To find expression for I\\ 
(or / ls ), N mobile units are assumed to be uniformly distributed in an equivalent disc of 
radius R shown in Figure 17.8. The density of mobile unit is p — N/jtR 2 [1, 20]. 

The power received in the BS from each mobile is assumed here to be P r with a path 
loss proportional to the fourth power of distance. The interference caused by mobiles in an 
adjacent macrocell is 



2NP r f , r ae 
P{d) =—jr^rj (d2 + r2 _ 2drcos6)2 



o 



where r is the distance between the mobile and the adjacent macrocell BS, x — 
*J(d 2 + r 2 — 2dr cos 6) is the distance between the mobile in the adjacent macrocell and 
the interfered BS, and d is the distance between two BSs. Therefore, the interference to the 
central macrocell BS caused by the six adjacent macrocells can be calculated as 

6 

h = p ( d * = 2R ~> = 0-378AfP r (17.14) 
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Similarly, the interference to the microcell BS caused by the six adjacent macrocells is 
expressed as 



I ls = J2P(di)(h bs /h bl ) 2 = i s NP r 



(17.15) 



where 



i s = 2(h hs /h h i f 



2(df/R 2 )ln 



d} - R 2 



4df - 6d?R 2 + R 4 
2(d 2 -R 2 ) 2 



(17.16) 



depending on the location of microcell. 



17.2.2 Performance example 

A detailed analysis of the above system by simulation is given in Wu et al. [20]. Four 
different power control schemes are analyzed: 

(1) both the macrocell and microcell use the signal- strength-based power control for the 
uplink and there is no power control for the downlink; 

(2) (a) [or scheme 2(b)], both the macrocells and microcells use the signal-strength- 
based power control for the uplink, and downlink power control is also applied for 
the microcell (or applied for macrocells and microcells); 

(3) both the macrocells and microcells use the C/I power control for the uplink and 
there is no power control for the downlink; 

(4) both the macrocells and microcells apply the C/I power control for uplink, however, 
but only the microcell uses power control for the downlink. 

In addition, antenna heights of macrocells and microcells are 45 and 6 m, respectively. 
The radius of the microcell is r — 1000 m, the radius of the macrocell is R, and R = lOr. 
The bps are 342 m for the microcell and 2569 m for the macrocell. 

The arrival process for users in a cell is Poisson distributed with a mean arrival rate 
of X, and the holding time for a call is exponentially distributed with the mean of l//x. 
The required E\,/No for uplink and downlink in a CDMA system is around 7 and 5 dB, 
respectively, if the bit error rate is less than 10~ 3 . The major performance we are concerned 
with is the capacity. Under this maximum loading condition, the background noise is very 
little relative to interference, so that C/I is the only index to survey. The processing gain 
for a CDMA system is defined as the ratio of bandwidth over data rate, that is, 21 .0 dB (i.e. 
1.2288 MHz/9.6 kbs), so that the threshold of (Q7) tu is 14 dB for uplink and the threshold 
(C//)td is 16 dB for the downlink.For this set of parameters the blocking probabilities for 
the two cells are given in Figures 17.9 and 17.10. 



17.3 MULTITIER WIRELESS CELLULAR NETWORKS 



In this section we further extend the previous problem to a general cell-design methodology 
in multitier wireless cellular network. Multitier networks may be a solution when there are a 
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multitude of traffic types with drastically different parameters and/or different requirements, 
such as different mobility parameters or quality-of-service requirements. In such situations, 
it may be cost-effective to build a multitude of cellular infrastructures, each serving a 
particular traffic type. The network resources (channels) are then partitioned among the 
multitude of tiers. In general terms, we are interested in quantifying the cost reduction 
due to the multitier network design, as opposed to a single-tier network. 4G networks are 
expected to serve different mobility platforms and support multimedia applications through 
a newly deployed infrastructure based on the multitier approach. 

The cellular systems for the 4G of wireless multimedia networks will rely on cells that 
are smaller than those used today. In particular, in the proposed microcellular systems, 
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the cell radius can shrink to as small as 50 m (compared with 0.5-10 mile radius range 
for today's macrocellular systems). Smaller cell radii are also possible for systems with 
smaller coverage, such as pico- and nanocells in a local-area environment. The size of the 
cells is closely related to the expected speed of the mobiles that the system is to support. 
In other words the faster the mobiles move, the larger the cells should be. This will keep 
the complexity of handoffs at a manageable level. The cell size is also dependent on the 
expected system load (Erlangs per area unit). The larger the system load, the smaller the cell 
should be. This is the direct outcome of the limited wireless resources assigned to handle 
cell traffic. Smaller cells result in channels being reused a greater number of times and thus 
greater total system capacity. Finally, the cell size also depends on the required QoS level. 
This includes the probability of call blocking, call dropping or call completion. In general, 
the more stringent the QoS, the smaller the cells need to be, since smaller cells increase the 
total system capacity. Unfortunately, smaller cells require more base stations, and hence 
higher costs. 

In this section, we discuss the multitier concept to show how to optimize the design of a 
system with differing traffic requirements and mobile characteristics. We examine how to 
lay out a multitier cellular system, given a total number of channels, the area to be covered, 
the average speed of mobiles in a tier, call arrival and duration statistics for each tier, and a 
constraint on the QoS (i.e. blocking and dropping probabilities). We discuss how to design 
a multitier cellular system in terms of the number of tier-/ cells (e.g. of macrocells and 
microcells in a two-tier system) and the number of channels allocated to each tier so that 
the total system cost is minimized. 

1 7.3.1 The network model 

For each tier, the total area of the system (coverage area) is partitioned into cells. All cells 
of the same tier are of equal size. The network resources are also partitioned among the 
network tiers. Channels allocated to a particular tier are then reused based on the reuse 
factor determined for the mobiles of that tier (i.e. within each tier, channels are divided into 
channel sets). One such set is then allocated to each cell of that tier. For simplicity, FCA 
for the allocation of channels both among the tiers and within a tier is used. 

In this section, connection-oriented traffic is considered. Each tier is identified by several 
parameters: call arrival rate, call duration time, call data rate, average speed of mobiles, 
performance factors and QoS factors. Additional assumptions are that: 

• handoffs and new calls are served from the same pool of available channels which means 
that, for a given tier, the probability of blocking equals the probability of dropping; 

• the speed of each class of mobiles is constant for each tier; 

• new call arrivals and call terminations are independent of the handoff traffic; 

• the traffic generation is spatially uniformly distributed; 

• call arrivals follow a Poisson process; 

• the spatial and time distribution of call arrivals are the same over all cells in a given tier, 
but can be different from tier to tier; 

• call duration for a given tier is exponentially distributed, and may vary from tier to tier. 
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In the presentation the following notation is used [21]: 

• X®, rater of tier i calls initiated per unit area, calls/(s m 2 ); 

• A.,-, call initiation rate in a tier i cell (calls/s); 

• 1/Lt, , average call duration of a tier i cell (s call); 

• hi, mean number of handoffs for a tier i call (handoffs/p call); 

• yi, the relative cost of a tier i base station to the cost of a tier S base-station ( y s = 1); 

• A, total area to be covered (m 2 ); 

• S, number of tiers; 

• C, total number of available network channels; 

• C,-, number of channels allocated to tier i; 

• Ni , number of channels allocated to a tier i cell; 

• fi , number of tier i cells in the frequency reuse cluster; 

• nii, number of tier i cells contained in a tier (i - 1) cell; tier 0 defines the total coverage 

def 

area; m\ — total number of tier 1 cells; 

• Ri, radius of a tier i cell (m); 

• Vi, average speed of tier i mobile users (ms); 

• Ps(i), actual blocking probability for a tier (' cell; 

• Po(i), actual dropping probability for a tier i cell; 

• Pg ' dx (i), the maximum acceptable blocking probability for tier (' calls; 

• Pp ax (i), the maximum acceptable dropping probability for tier i calls; 

• Pl(i), loss probability of tier i; 

• PLT, overall system loss probability, weighted by the amount of traffic of each tier; 

• PLT max , the maximum acceptable weighted system loss probability; 

• TSC, total system cost; 

• TSC max , maximum allowable total system cost; input design parameter tabbing. 

Cost optimization is supposed to minimize the cost of the multitier infrastructure. In our 
model, we assume that the major part of the total system cost is the cost of base station 
deployment, which in our model is proportional to the total number of base stations. Thus, 
the total system cost, TSC, is: 

s i 

TSC = mi{yi+m 2 [Y2 + m 3 (- ■■)]} = J2 yi Yl m j' S - 1 (11A1) 

In the above equation, we assume that cells are arbitrary split so that the location of tier i 
base stations is independent of the location of the base stations of the other tiers. In a more 
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Figure 17.1 1 Cell splitting in multitier network. 

structured splitting cells of tier i form rings within a cell of tier (i — 1) as shown for the 
two-tier case in Figure 17.11. Thus, if a base station already exists in a cell for tier i , the cost 
of placing an additional base station for a higher tier is negligible. Therefore, for example, 
in the case of a two-tier network the total cost can be reduced by the cost of the tier 2 base 
stations. In such a case, the total cost is 

S I i i-1 \ S i-1 

TSC = J2Yi[Y[m j -Y[m j \=y 1 m 1 +J2yi(mi-^Y[m j, S>1 (17.18) 

i=l \j=l j=l J i=2 j=l 

It will be further assumed that the set of all the available channels to tier C, , is 
equally divided among the /, cells in the frequency reuse cluster, which means no channel 
sharing between the cells. There is no channel sharing among the tiers either. It is also 
assumed that no channels are put aside for handling handoffs. Therefore, the blocking and 
dropping probabilities are equal, and will be referred to as the probability of loss, P\£i)\ 
that is, 

Vi, P L (i) = Pad) = P D (i) and P™\i) = min {f B max (/), P^(i)} 
The overall system loss probability, PLT, is given by: 

PLT=^ (17.19) 

i=l 

The layout design problem is to minimize TSC, when the total number of available channels 
is at most C, and subject to the following QoS constraints: PLT < PLT max andP L < P™ ax .In 
other words: Find {m, } minimizing TSC with £ J=i Cj <C and PLT < PLT max and P L < 
P™ x [21]. 

The average number of handoffs, hi, that a call of tier i will undergo during its 
lifetime is 



hi = (3 + 2y/3)Vi/9in R it i = l,2,...,S [22] 
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Now assume that there are a total of M,- cells in tier-/ (M, = Yl'j=i m j)- Then the total 
average number of handoffs per second in all the cells is ft,-A,,-M,-. The total arrival rate of 
handoff and initial calls in all tier i cells is A., M, (hi + 1), and the average total (handoffs and 
initial calls) rate per cell is Xf tdi — A.,(ft,- + 1). It was assumed that the total arrival process 
to a cell is still Markovian, with the average rate of A| otal . Since the area of a hexagon with 
radius R is 3^3 /2R 2 , we obtain A.,- = 3^/2X°Rf/2. This leads to 



[(2 + 73)^ + 373^,^1 

i 0 D L J 



A' otal = X"Ri ± (17.20) 

The average call termination rate is [22] fif tdi — /x,(l + 9/i,). We can use the Erlang-B 
formula (see Chapter 6) extended to M/G/c/c systems. The probability of loss of a tier ;' call 
is therefore 

P L (i) = V ' ' ' — (17.21) 

{[A,(l+ft i )]/[M ! (l+9^)]} A '7A f ! -! . , > 

, i — 1, 2, . . . , S 



Eto {[(^(1 + Ai)]/[A*/(1 + 9/1,)]} Vj! 



where Af tal , ,uf tal , A.,- and h t are given above, and N f = LQ//J , / = 1, 2, . . . , S. 

For a two-tier system (S — 2), the cost function simplifies to TSC = X\ ■ m\ + \% • m\ ■ 
(1112 — 1). Parameters mi, m-i and R2 as a function of the area, A and the tier 1 cell radius, 
Ri, can be expressed as 

mi = [A/Ail = f"2V3A/97?i"] (17.22) 

where Al is area of tier 1 cell. We assume that both tier 1 and tier 2 cells are hexagonally 
shaped and that tier 2 cells are obtained by suitably splitting the tier 1 cells. Each tier 1 cell 
will contain k layers of tier 2 cells. Figure 17.11 shows an example of a tier 1 cell which 
contains three layers (0, 1 and 2). From the geometry of a hexagon, the number of tier 2 
cells in a tier 1 cell is given bym2= 1 + 6 + ■ • • + 6A: = 1+ 3k(k +1), k — 0, 1, ... , 
where (A: + 1) denotes the number of 'circular layers' in the cell splitting. Also, the radius 
of a tier-2 cell is given by R2 = R\/^/m2- 

The constrained optimization problem outlined above consists of finding optimal values 
of R\, R2, C\, C2, mi and m 2 , which are the system design parameters. The optimal values 
will be referred to as /?*, R^, C*, C\, m\ and m\. The parameters mi, m 2 , Ci and C2 are 
integers, while Ri and R2 are continuous variables. However, we only need to consider a 
discrete subset of the possible values of R\ . From Equation (17.22), it is sufficient to consider 
just those values of R t for which R t — *J(2*j3A/9l) where £ — Wmini, ■ ■ • , m m . dxl ; m^ni 
and m maxl are the lower and upper bounds, respectively, of the number of tier 1 cells in the 
system. Thus, can only assume one of m maxl - m minl + 1 values. 

The lower bound, Wmini, is usually assumed to be equal to 1, while the upper bound, 
m mdx \, can be estimated from TSC max , which is given to the designer, as m max i = 
[~TSC max /yi] ■ The following search procedure is used in Ganz et al. [21] to find the design 
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parameters. 



TSC* = TSC max 
m m - dxl — rTSC max /yil 

a x = 3V2A°/2 

a 2 = 3V2\° 2 /2 

by = (3 + 2V3)V!/9/xi 

b 2 = (3 + 2V3)y 2 /9/x 2 



2V3A/9 
while (OTmini < m\ < m max i) do 

Ri = c/+Jm{ 
X\ — ci 1 
hi = hxlR\ 

while (1 < Ci< C) do 

N\ = LC1//1J 
N 2 = L(C-d)// 2 J 

{[A 1 (l+/i 1 )]/[/zi(l + 9/i 1 )]} A '' /M! 



Pl(1) = 



Efio ft^id + hi)]/[HiQ + 9Ai)F'/j! 
for (£ = l;;k++) do 

m 2 = 1 + 3£(fc + 1) 
TSC = yinii + y 2 m\(m 2 - 1) 

if (TSC > TSC*) then break 



R 2 = Ri/yfm^ 
X 2 — a 2 R 2 
h 2 — b 2 /R 2 



{[k 2 (l + ft 2 )]/[/z 2 (l + 9h 2 )]} N > /N 2 l 

ri\L) — t- 

Ej=o {[^(1 + MVl/^O + 9h 2 W I) ! 
PLT= APlW + APlV) 

if (PLT < PLT max ) and [P L (1) < P™ x (l)] and [P L (2) < P™ x (l)] then break 
end for 
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if (TSC < TSC*) then 

k = k 
m* — m\ 
m*^ — mi 
R\ = R X 

R* 2 = Ri/yfmi 
C{ = d 
C 2 * = C - Ci 

end if 
end while 
end while 

The outputs are: TSC*, m\, m*, /?*, R*, C* and C\. 
17.3.2 Performance example 

To generate numerical results, a system with the parameters shown in Table 17.1 is used 
[21]. The performance of the two-tier system is compared with that of a one-tier system. To 
obtain results for a one-tier system the optimization algorithm with R\ — R2 was run. This 
results in both tiers sharing the same cells and m.2 — 1 . The total cost is then computed as 
TSC = m\Y\. 



Table 17.1 Example system parameters 



Parameter 


Value/range 


Units 


A 


100 


km 2 


C 


90 


Channels 


s 


2 






0.23.0 


Calls/(min km 2 ) 




5.040.0 


Calls/(min km 2 ) 


Ml. M2 


0.33 


Calls/min 


Y\ 


10.0 


$ (in 1000 s)/base 


Y2 


1.0 


$ (in 1000 s)ftase 


v t 


30540 


km/h 


v 2 


1.512.0 


km/h 


fufi 


3 




TSC m ax 


10000-20000 


$ (in 1000 s) 


PLT max 


0.01 






0.01 




pmax ( * ) 


0.01 
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Average tier 2 mobile speed (km/h) 

Figure 17.12 Comparison of the total system costs between the one-tier and two-tier 
systems, Aj = l[calls/(min km 2 )] and X\ — 20[calls/(min km 2 ).] 

Figure 17.12 depicts the total system cost as a function of tier 2 mobile speed, while 
tier 1 mobile speed is fixed for one-tier and a two-tier systems. The tier 1 mobile speeds 
considered are 30, 90, 180, 270, 360 and 540 km/h. 

The main conclusion from Figure 17.12 and many other similar runs [21] is that, for the 
parameter ranges used in this study, the two-tier system outperforms the single-tier system 
for all the values of the slower and faster mobile speeds. 

17.4 LOCAL MULTIPOINT DISTRIBUTION SERVICE 

Wireless systems can establish area-wide coverage with the deployment of a single base 
station. The local multipoint distribution service (LMDS) offers a wireless method of access 
to broadband interactive services. The system architecture is considered point-to-multipoint 
since a centralized hub, or base station, simultaneously communicates with many fixed 
subscribers in the vicinity of the hub. Multiple hubs are required to provide coverage over 
areas larger than a single cell. Because of the fragile propagation environment at 28 GHz, 
LMDS systems have small cells with a coverage radius on the order of a few kilometers. 
Digital LMDS systems can flexibly allocate bandwidth across a wide range of bi-directional 
broadband services including telephony and high-speed data access. 

Multiple LMDS hubs are arranged in a cellular fashion to reuse the frequency spectrum 
many times in the service area. Complete frequency reuse in each cell of the system is 
attempted with alternating polarization in either adjacent cells or adjacent hub antenna 
sectors within the same cell. Subscriber antennas are highly directional with roughly a 
9 inch diameter (30-35 dBi) to provide additional isolation from transmissions in adjacent 
cells and to reduce the received amount of multipath propagation that may cause signal 
degradation. Since cells are small and the entire spectrum is reused many times, the overall 
system capacity is quite high, and backhaul requirements can be large. Backhaul networks 
will probably be a combination of fiber-optics and point-to-point radio links. 
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The system capacity comes mainly from the huge radio frequency (RF) bandwidth avail- 
able: block A, 1 150 MHz (27.50-28.35, 29. 100-29.250 and 3 1.075-3 1.225 GHz); and block 
B, 150 MHz (31.000-31.075 and 31.225-31.300 GHz). For the purpose of frequency plan- 
ning for two-way usage it is essential to solve the problem of LMDS spectrum partitioning 
for the upstream and downstream. The standard duplexing options, frequency-division du- 
plex (FDD) and time-division duplex (TDD), are applicable in LMDS too and will not be 
discussed here in any more detail. Instead, as a wireless point-to-multipoint system, the 
deployment of LMDS will be assessed by the basic parameters of cell size and capacity. 
Obviously, an operator would want to cover as large an area as possible with a minimum 
number of cell sites, hence maximizing cell size. The modulation and coding, and the ensu- 
ing Eb/No, are factors in determining cell size. However, because of high rain attenuation 
at LMDS frequencies, there is a major trade-off between cell size and system availability 
determined by the rain expectancy for a given geographical area. In most of the United 
States, for the forthright aim of providing 'wireless fiber' availability of 0.9999-0.99999, 
the cells would only be between 0.3 and 2 miles [23]. Consequently, the deployment of 
LMDS will extensively involve multicell scenarios. In the following we review the meth- 
ods of optimizing system capacity in these scenarios through frequency reuse. The main 
problem related to frequency reuse is the interference between different segments of the 
system. Therefore, patterns that create bound and predictable values of SI I are essential 
to device deployment. Frequency reuse and capacity in interference-limited systems have 
been discussed in previous sections for mobile cellular and personal communications ser- 
vices (PCS) systems. As the basic methods and principles also apply to fixed systems, there 
are several major differences in the treatment of fixed broadband wireless systems such as 
LMDS. 

Unlike mobile cellular, in LMDS the subscriber antennae are highly directional and 
point toward one specific base station. This, together with the nonmoving nature of the 
subscriber, results in much lower link dynamics. The channel can mostly be described 
as Rician (with a strong main ray) and not Rayleigh-like in mobile cellular. Since the 
subscriber employs directive antennae, it is compelled to communicate with only one base 
station, which excludes the use of macro diversity (or cell diversity) - a very beneficial 
method with mobile cellular, but one that complicates the interference and frequency reuse 
analysis. 

Also, mobile cellular service was originally intended for voice; therefore, it is designed 
for symmetric loading, while broadband wireless services generally have more downstream 
traffic than upstream. This reflects in the main issue of concern for interference study, which 
in PCS and mobile cellular is upstream interference because the base station receiver has 
the hard task of receiving from a multitude of mobiles transmitting through nondirectional 
antennae and suffering different fast fading. 

In LMDS the upstream is usually lower-capacity and employs lower-order modulation, 
which offers better immunity. Also, the slower fading environment allows a closed-loop 
transmission power control system to operate relatively accurately. Consequently, upstream 
most subscribers transmit at a power lower than nominal, unlike the downstream transmis- 
sion. Also, the narrow beam of their antennae is an interference-limiting factor. Therefore, 
downstream interference is the most problematic, as opposed to the mobile cellular case. 

Another parallel with mobile cellular refers to cellular patterns. Owing to the mobile 
nature of its subscribers, a mobile cellular system has to provide service from the start to a 
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whole area (town, region), with a large number of adjacent cells. The number of sectors is 
relatively low (2-4); otherwise, the mobile would go through frequent handoffs. 

In LMDS, in the long run, owing to the small cell size, the ever increasing hunger for 
bandwidth and the availability of radio and networking technologies, deployment is also 
expected to be blanket coverage. However, the economics of LMDS do not require this from 
the beginning. Operators will probably first offer the service in clusters of business clients 
with high data bandwidth demands and with the financial readiness for the new service, and 
later will gradually expand the service to larger areas. Second, the sectorization will be in 
denser patterns determined by the increasing demand for bandwidth and not limited by the 
requirement for handoff. The narrower sectors employing higher antenna gain in the base 
station also provide for larger cell size. 

Frequency reuse in one cell is illustrated in Figure 1 7 . 1 3 with three examples of frequency 
reuse by sectorization. The first step in frequency planning is to assume the division of the 
available spectrum into subbands, so adjacent sectors will operate on different subbands. 
Also, the sector structure has to be designed as a function of the capacity required and the 
SI I specification of the modem used. The need to divide into subbands is a function of 
base station antenna quality, specifically the steepness of the rolloff from the main lobe to 
sidelobes in the horizontal antenna pattern. If the antenna sidelobes roll off very steeply after 
the main lobe, it is possible to reuse the frequency every two sectors, resulting in patterns 
of the type A, B, A, B, . . . , as in Figure 17.13(b). In this context the reuse factor, F R , is the 
number of times the whole band is used in the cell, which for the simplified regular patterns 
is the number of sectors divided by the number of subbands; in this case Fr — 3. In a more 
conservative deployment the frequency is reused only in back-to-back sectors as in Figure 
17.13(a), which has six sectors with reuse pattern A, B, C, A, B, C. Figure 17.13(c) shows 
a higher-capacity cell where the pattern A, B, C, . . . , is repeated in 12 sectors, resulting 
in F R — 4. Obviously, to keep the equipment cost low, we want a low number of sectors; 
schemes where we divide the spectrum in as few subbands as possible (two is best). 

With present antenna technology at 28 GHz it is possible to achieve sidelobes under 
—33 dB, but the radiation pattern of the deployed antenna is different. The sidelobes are 
significantly higher because of scattering effects such as diffraction, reflection or dispersion 
caused by foliage. The very narrow beamwidth of the subscriber antenna (which at 28 GHz 
can be reasonably made 3° or lower) helps limit the scattering effect. Obviously, the sum of 
such effects depends on the millimeter-wave effects in the specific buildings and terrain in 
the area, and have to be estimated through simulation and measurements for each particular 
case. Another uncertain factor in the estimation of SI I is the fading encountered by the direct 
signal. As a starting assessment, we shall consider the above-mentioned effects accountable 
for increasing the equivalent antenna sidelobe radiation to —25 dB. 

Based on this, the following estimations of SI I are only orientative. For each particular 
deployment the worst case has to be estimated considering the particular conditions. 



17.4.1 Interference estimations 

To allow A, B, C, . . . , type sectorization, in Figure 17.13(a) and (c) the base station antenna 
sidelobe is a — —25 dB at an angle more than 2.5 B^b from boresight, where B^b is 
the — 3dB beamwidth (the main lobe is between ±0.5 B3d B ). In Figure 17.13(a) and (b), 
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(a) 



F R = 4 




Figure 17.13 Cells with reuse factors of 2, 3 and 4. 

B 3dB = 60°, while in Figure 17.13(c) B MB = 30°. In Figure 17.13(b), better-quality anten- 
nas are considered, which would achieve the same sidelobe rejection at 1.5 fi3dB, driving 
frequency reuse higher for the same number of sectors. 

17.4.2 Alternating polarization 

Figure 17.14(a) shows how a high reuse factor of 6 can be achieved in 12 sectors by 
alternating the polarity in sectors. The lines' orientations show the polarization, horizontal 
(H) or vertical (V). The amount of discrimination that can be achieved depends on the 
environment. Although the antenna technology may provide for polarization discrimination 
of 30-40 dB, we shall consider that the combination of depolarization effects raises the 
cross-polarization level to p — —7 dB. 

The interference is reduced, so the same frequency at the same polarization comes only in 
the fourth sector. The problem is that a deployment has to allow for gradual sectorization - 
start by deploying minimum equipment, then split into more sectors as demand grows. 
However, if alternating polarization is employed in sectors, the operator would have to visit 
the subscriber sites in order to reorient the antennas. A more conservative approach would 
be to set two large areas of different polarity, as in Figure 17. 14(b). This does not reduce the 
close-in sidelobes but reduces overall interference and also helps in the multiplecell design. 
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Figure 17.14 Twelve-sector cells with cross-polarization. 



Following is an approximation for the SII: 
S S 



I £ h + Nr £ «i St + £ Pj«j Sj + N R 
i i i 



(17.23) 



where 5, are the transmission powers in other sectors using the same subband, the 
interferers, and N R the receiver input noise. Later the case will be considered where all 
transmission powers in sectors are equal to 5, = S, and, A^ R is neglected. N\ and N2 are 
the number of sectors with the same polarization and the cross-polarized ones, respectively. 
As well, the worst case values a and p will be taken for the sidelobe gains and cross- 
polarization: 

S 1 

- - . (17.24) 

/ a(N x + N 2P ) 

Thus, in a first approximation [24], by applying Equation (17.24), the SII level (or 
co-channel interference, CCI) at the subscriber receiver is given in Table 17.2. The type of 
modulation and the subsequent modem SII specification have to be specification have to be 



Table 17.2 The SII level at the subscriber receiver 



Figure 17.13(a) Figure 17.13(b) Figure 17.13(c) Figure 17.14(a) Figure 17.14(b) 



SII in 

dB 25 22 20 22 25 
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specified with sufficient margin. A modem receiver operates in a complex environment of 
challenges of which the SI I or CC1 is only one. Other impairments such as equalizer errors, 
adjacent channel interference, phase noise and inter-modulation induced by the RF chain 
limit the SI I with which the modem can work in the real operation environment. The above 
technique can be extended to multiple cell scenario. Details can be found in Roman [24]. 



17.5 SELF-ORGANIZATION IN 4G NETWORKS 
17.5.1 Motivation 

Self-organisation is an emerging principle that will be used to organise 4G cellular networks 
[25^-0]. It is a functionality that allows the network to detect changes, make intelligent 
decisions based upon these inputs, and then implement the appropriate action, either mini- 
mizing or maximizing the effect of the changes. Figure 17.15 illustrates a multitier scenario 
where numerous self-organizing technologies potentially could be applied. 

Frequency planning discussed so far in this chapter was performed by choosing a suitable 
reuse pattern. Individual frequencies are then assigned to different base stations according 
to propagation predictions based on terrain and clutter databases. 




Self-organization 



Radio resource management Base station bunching 




Microcells 



Picocells 



Global cell 
Macrocells 



Figure 17.15 Multitier scenario with self-organising technologies. 
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The need to move away from this type of frequency planning has been expressed in the 
literature [28] as well as being emphasized by ETSI in the selection criteria for the UMTS 
air-interface technique. The main reason for this departure is the need for very small 
cell sizes in urban areas with highly varying morphology, making traditional frequency 
planning difficult. Another reason lies in the difficulty associated with the addition of new 
base stations to the network, which currently requires extensive reconfiguration. In view 
of these two arguments, a desirable solution would require the use of unconfigured base 
transceiver stations (BTSs) at all sites; these BTSs are installed without a predefined set of 
parameters and select their operating characteristics on the basis of information achieved 
from runtime data. For instance, they may operate at all the available carriers and select 
their operating frequencies to minimize mutual interference with other BTSs [28]. 

The increasing demand for data services means that the next generation of communi- 
cation networks must be able to support a wide variety of services. The systems must be 
location- and situation-aware, and must take advantage of this information to dynamically 
configure themselves in a distributed fashion [29]. 

There will be no central control strategy in 4G, and all devices will be able to adapt to 
changes imposed by the environment. The devices are intelligent and clearly employ some 
form of self-organization [30, 31]. So far in our previous discussion, coverage and capacity 
have been the two most important factors in cellular planning. To have good coverage in both 
rural and urban environments is important so as to enable customers to use their terminals 
wherever they go. Coverage gaps mean loss of revenue and can also lead to customers 
moving to a different operator (which they believe is covering this area better). On the other 
hand, some areas may not be economically viable to cover from the operator's point of view 
due to a low population density. Other locations, for instance a sports stadium, may only 
require coverage at certain times of the day or even week. 

Capacity is equally important. Without adequate capacity, users will not be able to 
enter the network even though there might be suitable coverage in the area. Providing the 
correct capacity in the correct location is essential to minimize the amount of infrastructure, 
while ensuring a high utilization of the hardware that has actually been installed. Based on 
the traffic distribution over the duration of a day reported in Lam et al. [32], an average 
transceiver utilization of only 35 % has been estimated. Improving this utilization is therefore 
of great interest. 

A flexible architecture is essential to enable the wide variety of services and terminals 
expected in 4G to co-exist seamlessly in the same bandwidth. In addition, future upgrades 
and reconfigurations should require minimum effort and cost. The initial investment, the 
running costs and the cost of future upgrades are expected to be the three most important 
components in determining the total cost of the system. 



17.5.2 Networks self-organizing technologies 

Capacity, both in terms of bandwidth and hardware, will always be limited in a practical 
communication system. When a cell becomes congested, different actions are possible. 
The cell could borrow resources, bandwidth or hardware, from a neighboring cell. It could 
also make a service handover request to a neighbor in order to minimize the congestion. 
Thirdly, a service handover request could be made to a cell in a layer above or below in 
the hierarchical cell structure. Finally, the cell could try to reduce the path loss to the mobile 
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terminal to minimise the impact of other cell interference. If neighboring cells are unable 
to 'assist' the congested cell, the options left for the cell are to degrade the users' service 
quality (if it is interference limited) or to try and influence the users' behavior. This can 
be achieved through service pricing strategies. The pricing scheme can be regarded as a 
protection mechanism for the network. Since it cannot create capacity, only utilise what it 
already has, it needs to force the users to adapt their behavioral pattern until the network is 
upgraded or there is more capacity available. Self-organizing technologies fall into one of 
these categories. 

1 7.5.2. 1 Bunching of base stations 

In a micro- and picocellular environment there will be severe fluctuations in traffic demand, 
user mobility and traffic types. This highly complex environment will require advanced 
radio resource management (RRM) algorithms and it will be beneficial to have a central 
intelligent unit that can maximize the resource utilization. The bunch concept has been 
proposed as a means to deal with this issue. It involves a central unit (CU) that controls a set 
of remote antennas or base stations (which have very little intelligence). The central units 
will deal with all decisions on channel allocation, service request and handover. Algorithms 
for layers 1 and 2 (such as power control) may be controlled by the remote unit itself. The 
bunch concept can be viewed simply as a very advanced base station with a number of 
small antennas for remote sensing. The central unit will therefore have complete control 
over all the traffic in its coverage area and will be able to maximize the resource utilization 
for the current traffic. This provides opportunities for uplink diversity and avoids intercell 
handovers in its coverage area. The bunch approach will typically be deployed in city 
centres, large buildings or even a single building floor. 

17.5.2.2 Dynamic charging 

The operators need to encourage users to utilize the network more efficiently, something that 
can be achieved through a well thought-out pricing strategy. Pricing becomes particularly 
important for data services such as e-mail and file transfer as these may require considerable 
resources but may not be time-critical. A large portion of e-mails (which are not time- 
critical) could, for example, be sent during off-peak hours, hence improving the resource 
utilization. In this area two main approaches are used, user utility method and maximum 
revenue method. 

User utility algorithms assume that the user associates a value to each service level that 
can be obtained. The service level is often referred to as the user's utility function and it 
can be interpreted as the amount the user is willing to pay for a given quality of service. 
It is assumed that the user acts 'selfishly', always trying to maximize their own utility (or 
service). The whole point with a pricing strategy is to enable the operator to predict how 
users will react to it, something which is not trivial. Current proposals do not try to determine 
the exact user's utility function, but rather to postulate a utility function which is based on 
the characteristics of the application or service. Two prime examples are voice and data 
services, which exhibit very different characteristics. Although speech applications are very 
sensitive to time delays, they are relatively insensitive to data errors. Similarly, although 
data services are relatively insensitive to time delays, they are very sensitive to data errors. 
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Figure 17.16 An intelligent-relaying overlay. 

The maximum revenue method suggested in Bharghavan et al. [36] is based on letting 
the network optimize its revenue by allocating resources to users in a manner which is 
beneficial for the network. The two main principles are to maximize the resources allocated 
to static flows and to minimize the variance in resources allocated to mobile flows. These 
rules are based on the assumption that, whereas a static user's preference is for maximum 
data rate, mobile users are more concerned with the variance in service quality as they move 
from cell to cell. The actual revenue model is based on a 4-tuple, <A, T, C a , F>, namely 
admission fee, termination credit, adaptation credit and revenue function. The network uses 
these parameters to optimize its revenue. Maximum revenue is calculated according to the 
following rules: (1) the flow pays an admission fee once it is granted its minimum requested 
resource allocation; (2) if a flow is prematurely terminated by the network, the latter pays 
the flow a termination credit; (3) the network pays the flow an adaptation credit if the 
resource allocation is changed during the transmission, regardless of whether the allocation 
is increased or reduced; (4) the flow pays a positive but decreasing marginal revenue for 
each extra unit granted by the network. The flow does not pay for resources allocated above 
its maximum requested resource allocation. 

Intelligent relaying is a technique that can minimize the amount of planning and the 
number of base stations required in a cellular network. A network employing intelligent 
relaying includes mobiles that are capable of passing data directly to other mobiles, rather 
then directly to a base station, as for a conventional cellular network. In order to plan a 
network incorporating intelligent relaying, it is convenient to consider each mobile as a 
'virtual cell', acting as a base station at its center. The coverage area of this virtual cell, as 
seen in Figure 17.16, can be varied according to the circumstances, as the mobile changes 
its transmit power and according to the mobility of the user. The mobile will set the radius 
of its virtual cell according to the number of other mobiles in the vicinity available to relay 
data; the size of the virtual cell will be minimized to improve frequency reuse. 

Context awareness in 4G will, for example, be a scenario in which devices such as 
personal digital assistants (PDAs) should be able to communicate at short range with a 
number of other devices. These devices might be a fax machine, a computer, a mobile 
phone, a printer or a photocopier. Assuming all these devices have a low-power radio, then 
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the PDA would be able to engage any of the devices. The task could be printing documents, 
downloading documents, faxing a message or uploading data for storage. To the user, the 
interdevice communication will be transparent. The user will only be concerned with the task 
they are performing, not the devices they are connected with. This sort of communication 
between electronic devices can be achieved through transmission of beacon signals, which 
provide relevant information such as device capability and identification. As manufacturers 
throughout the world are incorporating the Bluetooth technology into their products, this 
concept is about to materialize [41]. 

The context or situation awareness concept can also be exploited in cellular networks. 
In current cellular systems, base stations transmit information on their broadcast control 
channel, which can be used to implement this idea. Assuming this information includes 
/ = [id, xJat, yJong, Tx] (base station identity, position in latitude and longitude, broadcast 
control channel transmit power), then this would enable the network to reconfigure its base 
station coverage areas when a base station is removed or added to the network. 

Dynamic cell sizing is another emerging technique that has received considerable interest 
in the literature [40]. By dynamically adjusting the coverage areas of the cells, optimum net- 
work performance can be achieved under any traffic conditions. When a single cell is heavily 
occupied, whilst the surrounding cells are lightly loaded, it is possible with this scheme to 
reduce the cell size of the loaded cell and to increase the size of the surrounding cells. In 
this manner, more users can be accommodated in the centre cell. This can be implemented 
in such a way that the base station controls its attachment area by increasing or decreasing 
its beacon transmit power, hence increasing or decreasing the area in which mobiles will 
connect to the base station. Under congestion the cell will contract such as to limit its service 
area and reduce the inter-base-station interference. This will enable it to serve more users 
closer to the base station. In the limit, the cell is so small that the interference contribution of 
neighboring base stations can be ignored and maximum capacity is achieved. In light traffic 
conditions it will expand and hence improve the network coverage with cells overlapping 
the same area. The cell to which the user connects will therefore be a function not only of 
the path loss between the base station and the user but also of the beacon transmit power. 

Intelligent handover (IH) techniques can be made more intelligent, also considering 
parameters such as resource utilization. A fast-moving mobile user who is currently served 
by a cell may run into problems in the next cell because it is fully congested. An intelligent 
handover algorithm would be able to recognize this problem and try to solve it by handing 
the user to the microcellular layer. The user will stay in this layer until the blocked cell 
has been passed, upon which it will be handed back to the macro layer. Similarly, if there 
is no coverage on the user's home network in the area in which they are moving, then the 
IH algorithm should seek to maintain the connection by handing over to a competitor's 
network or a private network offering capacity to external users. Performance evaluation of 
the above techniques can be found in Spilling et al. [42]. 
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18 

Network Management 



18.1 THE SIMPLE NETWORK MANAGEMENT PROTOCOL 

Back in 1988, the simple network management protocol (SNMP), was designed to provide 
an easily implemented, low-overhead foundation for multivendor network management of 
different network resources. The SNMP specification: (1) defines a protocol for exchang- 
ing information between one or more management systems and a number of agents; (2) 
provides a framework for formatting and storing management information; and (3) defines 
a number of general-purpose management information variables, or objects. The model of 
network management that is used for SNMP includes the following key elements: manage- 
ment station, management agent, management information base and network management 
protocol. 

The management station will have, at least: (1) a set of management applications for 
data analysis, fault recovery, and so on; (2) an interface by which the network manager 
may monitor and control the network by communicating with the managed elements of the 
network; (3) a protocol by which the management station and managed entities exchange 
control and management information; (4) The management station maintains at least a 
summary of the management information maintained at each of the managed elements in 
the network in its own database of information. Only the last two elements are the subject 
of SNMP standardization. 

The management agent is a piece of SNMP software in key platforms, such as hosts, 
bridges, routers and hubs, by which they may be managed from a management station. 
The management agent responds to requests for information from a management station, 
responds to requests for actions from the management station, and may asynchronously 
provide the management station with important but unsolicited information. In order to 
manage the resources in a network, these resources are represented as objects. Each object 
is essentially a data variable that represents one aspect of the managed system. The collection 
of objects is referred to as a management information base (MIB). The MIB functions as 
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a collection of access points at the agent for the management station; the agent software 
maintains the MIB. These objects are standardized across systems of a particular class (e.g. 
bridges all support the same management objects). A management station performs the 
monitoring function by retrieving the value of MIB objects. A management station can 
cause an action to take place at an agent or can change the configuration settings of an agent 
by modifying the value of specific variables. The management station and agents are linked 
by a network management protocol, which includes the following key capabilities: 

(1) Get - used by the management station to retrieve the values of objects at the agent. 

(2) Set - used by the management station to set the values of objects at the agent. 

(3) Trap - used by an agent to notify the management station of significant events. 

SNMP was designed to be an application-level protocol that is part of the TCP/IP protocol 
suite and typically operates over the user datagram protocol (UDP), although it may also 
operate over TCP. A manager process controls access to the central MIB at the management 
station and provides an interface to the network manager. The manager process achieves 
network management by using SNMP, which is implemented on top of UDP and IP. 

Each agent must also implement SNMP, UDP and IP. In addition, there is an agent 
process that interprets the SNMP messages and controls remote access to the agent's MIB. 
For an agent device that supports other applications, such as FTP, TCP as well as UDP is 
required. From a management station, three types of SNMP messages are issued on behalf of 
a management application: GetRequest, GetNextRe quest and SetRequest. The first two are 
variations of the get function. All three messages are acknowledged by the agent in the form 
of a GetResponse message, which is passed up to the management application. In addition, 
an agent may issue a trap message in response to an event that affects the MIB and the 
underlying managed resources. SNMP relies on UDP, which is a connectionless protocol, 
and SNMP is itself connectionless. No ongoing connections are maintained between a 
management station and its agents. Instead, each exchange is a separate transaction between 
a management station and an agent. 

Trap-directed polling technique is used if a management station is responsible for a 
large number of agents, and if each agent maintains a large number of objects. In this case 
it becomes impractical for the management station to regularly poll all agents for all of 
their readable object data. Instead, at initialization time, and perhaps at infrequent intervals, 
such as once a day, a management station can poll all the agents it knows of for some 
key information, such as interface characteristics, and perhaps some baseline performance 
statistics, such as average number of packets sent and received over each interface over 
a given period of time. Once this baseline is established, the management station refrains 
from polling. Instead, each agent is responsible for notifying the management station of 
any unusual event. Examples are if the agent crashes and is rebooted, the failure of a link or 
an overload condition as defined by the packet load crossing some threshold. These events 
are communicated in SNMP messages known as traps. 

Once a management station is alerted to an exception condition, it may choose to take 
some action. At this point, the management station may direct polls to the agent reporting 
the event and perhaps to some nearby agents in order to diagnose any problem and to 
gain more specific information about the exception condition. Having in mind that traps 
are communicated via UDP and are therefore delivered unreliably, a management station 
may wish to infrequently poll agents. All these options should be used in such a way as 
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to provide a reliable network management with minimum overhead traffic generated in the 
network. 

The use of SNMP requires that all agents, as well as management stations, must support 
UDP and IP. This limits direct management to such devices and excludes other devices, such 
as some bridges and modems, that do not support any part of the TCP/IP protocol suite. 
Furthermore, there may be numerous small systems (personal computers, workstations, 
programmable controllers), that do implement TCP/IP to support their applications, but 
for which it is not desirable to add the additional burden of SNMP, agent logic and MIB 
maintenance. 

To accommodate devices that do not implement SNMP, the concept of proxy was de- 
veloped. In this scheme an SNMP agent acts as a proxy for one or more other devices; that 
is, the SNMP agent acts on behalf of the proxied devices. Figure 18.1 indicates the type of 
protocol architecture that is often involved. The management station sends queries concern- 
ing a device to its proxy agent. The proxy agent converts each query into the management 
protocol that is used by the device. When a reply to a query is received by the agent, it 
passes that reply back to the management station. Similarly, if an event notification of some 
sort from the device is transmitted to the proxy, the proxy sends that on to the management 
station in the form of a trap message. The format of the SNMPvl message is given in 
Figure 18.1. More data for versions v2 — v3 and any additional detail on SNMP protocols 
can be found in References [1-6]. 

In a traditional centralized network management scheme, one host in the configuration 
has the role of a network management station; there may possibly be one or two other 
management stations in a backup role. The remainder of the devices on the network contain 
agent software and an MIB, to allow monitoring and control from the management station. 
As networks grow in size and traffic load, such a centralized system is unworkable. Too 
much burden is placed on the management station, and there is too much traffic, with reports 
from every single agent having to wend their way across the entire network to headquarters. 
In such circumstances, a decentralized, distributed approach works better. In a decentralized 
network management scheme, there may be multiple top-level management stations, which 
might be referred to as management servers. Each such server might directly manage a 
portion of the total pool of agents. However, for many of the agents, the management server 
delegates responsibility to an intermediate manager. The intermediate manager plays the 
role of manager to monitor and control the agents under its responsibility. It also plays 
an agent role to provide information and accept control from a higher-level management 
server. This type of architecture spreads the processing burden and reduces total network 
traffic and will be discussed in more detail in the next section. 

The common management information protocol (CMIP) [8, 9] was proposed as a standard 
to supercede SNMP. The standards are exhaustive in their specifications and include almost 
every possible detail for network management functions. CMIP was specified to work over 
the OSI [Chapter 1, 10] protocol stack. The drawbacks of CMIP are: (1) it is complex to 
use and implement; (2) it requires many resources to execute; (3) it has high overhead; and 
(4) few networks use the OSI protocol suites. 

The LAN man management protocol (LMMP) [8] was developed as a network manage- 
ment solution for LANs. LMMP was built over the IEEE 802 logical link layer (LLC). 
Therefore, it is independent of the network layer protocol. Functionally, it is equivalent to 
common management information service over IEEE 802 LLC (CMOL). The advantages 
of LMMP are that it is easier to implement and protocol-independent. The disadvantages 
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Figure 18.1 (a) Position of SNMP in the protocol stack; (b) proxy protocol architecture; (c) 
SNMP formats (reproduced by permission of IEEE [47]); (d) SNMP message 
and PDU fields (reproduced by permission of IEEE [47]). 
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(d) 



Field 


Description 


Version 


SNMP version; RFC 1 1 57 is version 1 


Community 


A pairing of an SNMP agent with some arbitrary set of SNMP application 
entities. The name of the community functions as a password to authenticate 

thp QrJMP moccano 


Request-id 


Used to distinguish among outstanding request by providing each request 
with unique ID 


Error-status 


Used to indicate that an expection occured while processing a request. 
Values are: noError (0), tooBig (1), noSuchname (2), badValue (3), 

roaHOn l\/ ld\ nonFrr 


Error-index 


When error-status is nonzero, error-index may provide additional 
information by indicating which variable in a list caused the exception. A 
variable is an instance of a managed object 


Variable-bindings 


A list of variable names and corresponding values. In some cases (e.g. 
GetRequest-PDU), the values are null 


Enterprise 


Type of object generating trap; based on sysObjectID 


Agent-addr 


Address of object generating trap 


Generic-trap 


Generic trap type. Values are: coldStart (0), warmStart (1), linkDown (2), 
linkup (3), authenticationFailure (4), egpNeighborLoss (5), 
enterpriseSpecific (6) 


Specific-trap 


Specific trap code 


Time-stamp 


Time elapsed between the last (re)initialization of the network 

entity and the generation of the trap; contains the value of sysUpTime 


Non-repeaters 


Indicates how many listed variables are to return just one value each 


Max-repetitions 


Indicates number of values to be returned for each of the remaining variables 



Figure 18.1 (Continued.) 

are that LMMP messages cannot go beyond a LAN boundary because LMMP does not use 
any network layer facilities. LMMP agents are also not dynamically extensible. 

The telecommunications management network (TMN) [11] was built over the OSI ref- 
erence model, but it includes support for signaling system number 7, TCP/IP, ISDN, X.25 
and 802.3 LAN-based networks. TMN has some advantages over the existing standards. 
For instance, TMN supports a larger number of protocols suites, incorporates features from 
existing management protocols, has wider domain of acceptance, and is 'future proof. 
The disadvantages are that large amounts of processing are required, and agents are not 
extensible at run time. 



18.2 DISTRIBUTED NETWORK MANAGEMENT 

As already indicated in Section 18.1, centralized network management systems (NMSs) 
are clients to management agents (servers) residing permanently in each managed network 
element (NE). Although adequate for most practical management applications, the limita- 
tions of SNMP, such as the potential processing and traffic bottleneck at the NMS, have 
been recognized for many years. 

The inefficiency can be reduced by distributing network management functions to a 
hierarchy of mid-level managers, each responsible for managing a portion of the entire 
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network, as in SNMPv2. Subnetworks can be managed in parallel, reducing the traffic and 
processing burden on the highest level NMS. Typically, the distribution of network man- 
agement functions and the organization of managers are fairly static. It has been observed 
that decentralizing network management functions may achieve several benefits [12-22]. 

(1) network traffic and processing load in the NMS can be both reduced by performing 
data processing closer to the NEs; 

(2) scalability to large networks is improved; 

(3) searches can be performed closer to the data, improving speed and efficiency; and 

(4) distributed network management is inherently more robust without depending on 
continuous communications between the NMS and NEs. 

In general in this context, the network management approaches will be classified as shown 
in Figure 18.2. In Figure 18.2(a), the client-server (CS) model represents the traditional 
SNMP paradigm where a centralized NMS polls a network of network elements. The 
communications between the NMS (client) and agents (server) is characterized by pairs 
of query-response messages for every interaction. Figure 18.2(b) represents a hierarchical 
static (HS) approach modeled as mid-level managers, each managing a separate subnetwork 
of network elements. 

A two-level hierarchy is considered here, although multiple hierarchical layers may be 
possible. Each subnetwork is managed in a client-server manner and mid-level managers 
may communicate with a centralized high-level NMS as needed. 

(b) 



NE 




Figure 18.2 Centralized and distributed network management approaches, (a) Client- 
server; (b) hierarchical static; (c) weak mobility; and (d) strong mobility. 
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In the weak mobility (WM) approach, the NMS distributes code to specific NEs where the 
code is executed, as shown in Figure 18.2(c). After performing its specific task, the code will 
typically report results back to the NMS and expire at the NE. During execution, the code 
does not have a capability for autonomous migration to other NEs. In the strong mobility 
(SM) approach, the NMS dispatches one or more agents to carry out a specific task, as shown 
in Figure 18.2(d). Agents have the capability to autonomously travel (execution state and 
code) among different NEs to complete their tasks. The route may be predetermined or 
chosen dynamically, depending on the results at each NE. 

18.3 MOBILE AGENT-BASED NETWORK MANAGEMENT 

As already discussed in Chapters 1 and 16, 4G itself will become a competitive mar- 
ketplace with a diversity of vendors, operators and customers. In this environment, users 
will be able to choose from a wide range of network services, which provide different 
bandwidth requirements and various QoS, and which will operate under different pricing 
schemes. Technical and market forces are driving the evolution of 4G toward the traded 
resource service architecture (TRSA) model, whereby network resources such as band- 
width, buffer space and computational power will be traded in the same manner as existing 
commodities. 

In such complicated environments, users, whether buyers or sellers, need tools that facil- 
itate expertise brokering activities such as buying or selling the right products, at the right 
price and at the right time. The brokering process will be guided by user preferences, which 
need to be processed according to some customized logic. The mobile agent technology 
seems to be a feasible option for dealing with these issues [23-34] . In other words, an ad hoc 
software sent across the network may allow network resources to be used more effectively, 
as they can be directly controlled at the application level. 

In order to provide each customer with the opportunity of implementing his/her own 
policy, trading costs with service quality according to his/her own preferences and capacities, 
network service must be open, flexible and configurable upon demand. A first step toward 
the realization of a programmable network is that of using an agent-based approach, in order 
to obtain a faster time scale of service deployment. A major incentive for an agent-based 
approach is that policies can be implemented dynamically, allowing for a resource-state- 
based admission and reservation strategy. Agents are used to discover about resources 
available inside the network and claim resources on behalf of customers according to some 
'figures of merit' [25], which represent tradeoffs between bandwidth claimed and loss risk 
incurred due to high utilization. Different customers may pay for resources in a different 
way, negotiating the costs for obtaining a certain 'figure of merit'. Agents are able to 
trigger adaptation of applications inside the network on behalf of customers. This allows 
for an immediate response to resource shortages, decreases the amount of useless data 
transported and reduces the signaling overhead. Mobile agents [26] provide the highest 
possible degree of flexibility and can carry application-specific knowledge into the network 
to locations where it is needed, following an approach similar to the one shown in some 
other programmable network projects discussed in Chapter 16 [23, 27]. 

As already indicated in Section 18.2, by adopting mobile code technology in network 
management it is possible to overcome some limitations of the centralized management 
approach. In general, such technology is based on the idea of reversing the logic according 
to which the data produced by network devices are periodically transferred to the central 



706 NETWORK MANAGEMENT 



management station. Management applications can then be moved to the network devices, 
thus, performing (locally) some micromanagement operations, and reducing the workload 
for the management station and the traffic overhead in the network. 

The code on demand paradigm, used in this concept, relies on a client which can dynam- 
ically download and execute code modules from a remote code server. The client has no 
need for unnecessary code installed, except for the runtime system allowing these mecha- 
nisms. Java applets are a very common example of such type of technology. The use of an 
approach based on code on demand increases the flexibility of the system, and maintains 
agents simple and small at the same time. 

The remote evaluation paradigm, is based on the idea that a client sends a procedure to be 
executed on a remote host, where it is run up to the end; the results are, therefore, returned 
to the client host which sent the code. This paradigm allows the issue of the bandwidth 
waste that occurs in a centralized system when micromanagement operations are needed to 
be dealt with. In a traditional system, no matter which processing has to be performed on 
the data of the device, they have to be moved from the SNMP-agent to the NMS, where they 
are processed. Conversely, thanks to the mechanism of remote evaluation, the actions to 
be performed can be developed and sent to the remote device, where they will be executed 
without generating traffic in the network (except the initial one for sending the code to 
be executed). A typical example is the computation of the so-called health functions [24]. 
In general, by health function we mean an expression consisting of several elementary 
management variables. Each variable provides a particular measure concerning the device 
to be controlled. By using a technology based on remote evaluation, we can compute these 
functions directly on the devices, and only when necessary. The code that performs such 
computations will be sent by the NMS and dynamically executed. This approach allows 
obtaining what is called 'semantic compression of data' . In general the manager of a network 
is not interested in the single value of an MIB variable, but in aggregate values containing 
higher-level 'information' . We can, therefore, develop a system in which the manager writes 
its management functions (or they might be already available), and then invokes their remote 
execution on specific devices, when necessary. 

The mobile agent adds more benefits to the ones that can be obtained with remote 
evaluation. In fact, in this case the action on a device is always expressly started by the NMS. 
Conversely, in the case of mobile agents, the ability to store the state during the movements 
allows applications to be developed in which the agent moves from one device to another, 
performing the management functions required. In addition, the agent can be delegated the 
task of deciding where and when to migrate according to its current state, thus reducing the 
interaction with the NMS, and making processing more distributed. 



18.3.1 Mobile agent platform 

Different platforms have been designed for the development and the management of mobile 
agents that give all the primitives needed for their creation, execution, communication, 
migration, etc. [24, 32, 35, 36]. The following agents are used: 

The browser agent (BA) collects some MIB variables from a set of nodes specified by 
the user. Both the variables of the MIB-tree to be collected and the servers to be visited 
are selected through an appropriate dialog box of the application 'SNMP-monitoring' . 
After being started, the agent reaches the first node to be visited, opens an SNMP local 
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communication session, builds a PDU-request containing the MIB variables to be searched, 
waits for the reply from the SNMP daemon, and saves the data obtained in an internal 
structure. Then, if other nodes need to be visited, it reaches them, and repeats the procedure 
mentioned above. Otherwise, it returns to the platform from which it has been launched, 
where the results of the research are presented. 

The daemon agent (DA) monitors a 'health function' defined by the user. For starting 
the agent, the function to be computed and the node of the network (where the computation 
has to be done) must be provided. Then this agent moves to the node in question, where it 
records the value of the function: if the value is higher than a fixed threshold (defined by 
the user), a notification message is sent to the server from which the agent has departed. 
The two agents described before directly interact with the SNMP daemon present in the 
different nodes. 

The messenger agent (MA), during its migration through the nodes of the network, 
interacts with other agents for collecting specific information produced by them. During 
the configuration, the agents to be contacted and the servers where they have to be searched 
need to be selected and (if necessary) also the number of times the agent has to contact 
such agents. Thus, the MA performs operations at a higher abstraction level than the mere 
retrieval of MIB variables. In fact, since DAs can perform the computation of any function 
on the different nodes of the network, the messenger allows collection of such information, 
thus obtaining a general description about the state of the network. 

The verifier agent does not perform an actual network management action. Its task is that 
of collecting important information, which might be useful for further operations of network 
management, from remote network devices to the starting host. It visits the nodes selected 
during the configuration, and computes a function whose purpose is the evaluation of the 
presence of a specific property of the system visited (for example, a software version, or the 
available space on disk, or the verification of some log files, etc.). The verifier agent then 
reports to the server, from which it departed, the list of the nodes that satisfy this property. 



18.3.2 Mobile agents in multioperator networks 

In 4G networks, many types of providers will be usually involved in order to complete 
the end-to-end service offering. Specifically, the SP (service provider) is responsible for 
the definition end delivery of the service characteristics, while the NP (network provider) 
provides the network infrastructure (i.e. high-speed network). In such an arrangement, 
the SP is essentially a customer of the NP, while the SP provides the service to its own 
customers or end-users (usually multiple customers with small to medium size). As a means 
of competition, many different NPs offer access to a remote CP (content provider) which 
provides multimedia services such as voice, data and video. Similarly, the SP is capable of 
accessing many different NPs to request service. The various network operators (providers) 
will then be competing to sell their network links to clients through a representative agent 
host, the network access broker (NAB)/the internetwork access broker (INAB). 

A scenario that involves three competing NPs and two SPs is illustrated in Figure 18.3. 
The three networks are owned by three different operators, and each one of them is respon- 
sible for resource allocation and pricing strategies within its own environment. 

Moreover, the three networks have some interconnection points with each other, therefore 
allowing traffic to flow among the different networks, as expected in an open marketplace. 
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Figure 18.3 Network model for agent brokering environment. 

The SP is informed periodically about the cost changes associated with each of those 
interconnection points. As a means of competition, all three networks offer the same access 
to the remote CP. The SP serving a particular client is responsible for identifying the 
best/optimal connection path (routing) per request of the customer. 

In Chapter 7, QoS routing with multiconstraint has been shown to be an NP-hard problem 
[34]. For this reason, in this section we additionally consider a genetic algorithm (GA), 
which presents a good method to handle multiconstraint optimization problems and does 
not depend on the properties of the problem itself [37, 38]. The remaining issue is how the 
underlying agent architecture may interact with the genetic algorithm itself. It is assumed that 
an SP may host a particular kind of agent [named broker agent (BrkA)] which is in charge 
of identifying the optimal path to manage a specific connection request. The interaction 
between a BrkA and the algorithm may occur according to the following strategies: 

(1) The BrkA is able to execute the algorithm in run-time upon the request from the PC 
client. 

(2) The BrkA sends a request to a network node where the genetic algorithm can be 
executed. The optimal path is then sent back to the BrkA, which activates the setup 
procedure. 

(3) A set of optimal paths for different pairs of PC client and CPs is stored in a database 
(eventually distributed), which is accessed from the BrkA to retrieve the more con- 
venient path to satisfy the specific request. Once the connection is established, the 
genetic algorithm can be re-executed in order to identify a more convenient path, if 
the case. 
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If network performance and reliability change for some reason, monitoring agents dis- 
tributed in the system will promptly react to pass the genetic algorithm the new data to 
recompute the new optimal paths. 

18.3.3 Integration of routing algorithm and mobile agents 

When network conditions change, SP always wants to find a good route for its customers 
in real time. That is, with certain QoS constraints, SP wants to find a cost-reasonable route 
for its costumers. The routing algorithm in SP needs to know: 

(1) traffic from SP to CP; 

(2) QoS requirements/constraints (i.e. time delay constraint from SP to CP); 

(3) connectivity of nodes of NPs, 

(4) bandwidth available for each link between nodes; 

(5) cost for the traffic to pass through each link; and 

(6) time delay for the traffic to pass through each link. 

The use of mobile agents provides the complementary underlying structure in order to 
obtain this information in a distributed and efficient manner. In Section 18.3.1, we provided 
a brief description of how the agents can be used to deal with the collection of information 
[31, 33] about the state of the network and the monitoring of health functions that can 
be defined by the SP. The definition and creation of BAs and DAs serve the purpose of 
collecting specific variables from a set of nodes (e.g. NABs and INABs) specified by the 
SP. Then, MAs, during their migration through specific nodes of the network, interact with 
the other agents for collecting specific information produced by them and, therefore, obtain 
a general description about the state of the network. Moreover due to the mechanisms of 
remote evaluation and mobile agents described in the previous sections, the actions to be 
performed can be developed by the BrkA of a specific SP and sent to other remote devices 
where they will be executed on behalf of the BrkA, therefore limiting the generated traffic in 
the network and distributing the computational load and effort. Once the data are collected 
the routing algorithms can be executed. 

A number of these algorithms were discussed in Chapter 6. For this reason in this section 
we additionally discuss only the GA. In general, GA is a stochastic algorithm searching 
process in the solution space by emulating biological selection and reproduction. It has been 
used to solve various optimization problems [38]. In this section, we discuss a genetic -based 
algorithm in order to address the problem described in the previous sections. In the following, 
we describe the different features and phases of the algorithm [35]. 

Encoding is used to map the search space of the specific problem into the space where 
GA can operate. In the literature, related work in using GA as an optimization tool to 
solve the famous Traveling Salesman Problem [37] has been reported. There are several 
encoding approaches mentioned in the literature, such as adjacency representation, ordinal 
representation and path representation. Some encoding approaches are not suitable for GA 
operation (crossover and mutation), and some other encoding approaches are inefficient 
in searching the solution space. In this section, path representation approach is used to 
naturally encode a route due to its easy implementation. That is, all the nodes that the route 
passes through are listed in sequence. In order to encode the route in a fixed data structure, 
'zeroes' are filled into the empty space of the code. 
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For instance, in a scenario with 10 nodes in addition to the CP and SP, an array with 10 
elements can be used to represent the route. Once the number of nodes that the route passes 
is less than 10, the corresponding element will be zero. For example [1 23400000 0] is 
used to represent the SP-1-2-3— 4- CP route. 

Population initialization is used to start GA computation. In the population initialization 
process, the number of nodes the route will pass through, which node will be in the route 
and the sequence of the nodes of the route can be randomly determined. However, there will 
be some solutions that may violate constraints of delay or interconnectivity. The 'penalty 
method' is used to deal with these constraints, as follows: 

(1) for those links that do not exist, a very large delay value is assigned; and 

(2) for those routes that violate the delay constraint, a penalty to their cost is added. 

In the algorithm, the following expression is used to evaluate the weighted cost of those 
illegal routes: 

C\r) = C(r)[a + D(r)/£» max ] 

where C'(r)is the weighted cost of route r, C(r) is the function that evaluates the total cost 
of the links that the route may pass through, D{r ) is the function that evaluates the time 
delay of the route, D m - dx is the upper bound of the time delay constraint, and a is the penalty 
constant (in the algorithm it is set to 2). 

The fitness F(r) of the solutions is proportional to their survivability during the algorithm 
computation. The selection operation, defined below, will use these values to keep 'good' 
solutions and discard the 'bad' solutions. For simplicity, we can use the cost of each route 
as the fitness of each solution. In order to prevent those solutions with very low fitness from 
being discarded by selection operation of GA at the first several computation loops of GA, 
the value of fitness of those 'bad' solutions is increased. In this way, some 'bad' solutions 
still have chances to survive at the beginning of the evolution of GA and the 'good' parts in 
them will have chances to transfer to new generations. For convenience, we normalize the 
fitness of solutions to [0,1] by the following expression: 

„, N [C max - C(r)]C 

mm 

F(r) — 

(Cmax C m i n )C(r) 

where C max and C m ; n and are maximum and minimum cost of routes in each generation of 
population, respectively. 

Selection operation keeps 'good' solutions while discard 'bad' solutions at the same time. 
This operation tries to simulate 'natural selection' in real life. Those readers with background 
in communications theory will easily recognize the similarities between GA and the Viterbi 
algorithm (VA) used to approximate maximum likelihood (ML) demodulators. 

Two selection operators are used in the algorithm. The first one is based on the 'fitness 
proportional model'. It is also called 'Monte Carlo selection'. The algorithm is as follows: 

(1) add the fitness of all solutions; 

(2) randomly generate a number between zero and the sum of fitness; this number is 
called the pointer of the Monte Carlo wheel; and 

(3) add the fitness of each solution one by one until the value is greater than the pointer. 
Then the last solution is being selected. 
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Using this algorithm, the higher the fitness value, the bigger the chance of that solution 
being chosen. The second selection operator used in the algorithm is the 'best solution 
reservation'. The best solution in the population will always survive and several duplicated 
copies will be generated for mutation operation. In this way, the GA will always converge 
to a certain 'good' solution. Moreover, there will be a good chance to find a better solution 
on the base of the best solution of each generation. 

Crossover operation enables any pair of solutions in the population to have a chance 
to exchange part of their solution information with others. Therefore, those 'good' parts 
from different solutions may be combined together to create a new, better solution. The two 
original solutions are the 'parents' of the new solutions generated. There are many crossover 
operators designed to solve different problems. In this section, we use traditional one-point 
crossover method. That is, we find a certain point of the array and swap the part before 
and after the cross point to generate two new solutions. However, because the number of 
nodes the route may pass through is not fixed, it is difficult to determine a fixed crossover 
point. In Papavassiliou et al. [35] the crossover point is determined by [(A + B)/4J , where 
A and B are the numbers of nodes that the two routes will pass through, respectively, and 
the operator ' |_ J ' is a rounding function, e.g. if it is before the crossover operation, there 
are two routes: [1 2 3 4 5 0 0 0 0 0] and [6 7 8 0 0 0 0 0 0 0]. According to this procedure 
( L(A + B) /4J = 2), after the crossover we get: [128000000 0] and [6 73450000 0]. 
Note that only part of the population will experience crossover operation; this rate is called 
the crossover rate. 

The mutation operation randomly chooses a solution in the population and then slightly 
change it to generate a new solution. In this way, there is a chance to find better solution 
that cannot be found by only crossover operation. In the algorithm, four mutation operators 
are used: 

(1) randomly delete a node from a route; 

(2) randomly add a node into a route; 

(3) randomly delete two nodes from a route; and 

(4) randomly add two nodes into a route. 

Only part of population will experience the mutation operation (this is characterized by 
mutation rate). In order to enhance the local searching ability, those copies of the best 
solution of each generation are all treated by mutation operator. In this way, the GA may 
find a better solution that is 'close' to the best solution of each generation. 

In the repair operation, during crossover and mutation operation, illegal representation 
of route may be generated because duplicated elements (nodes) may appear in the same 
route. In the algorithm, those duplicated nodes that would bring high cost to the route are 
deleted. For example, there may be a route like [1 2 3 4 5 3 7 0 0 0], where node '3' is 
duplicated. We can evaluate the cost of strings (2 3 4), (5 3 7) and the delay of strings (2 3 4), 
(5 3 7). If the weighted cost of string (2 3 4) is less than (5 3 7), node 3 in (5 3 7) will be 
discarded. 

The computation efficiency can be improved if after a minimum number of trails 
(MinTrails), when the algorithm has found a feasible solution and has made no more 
improvement for a specific period of time, the computation process is stopped. In the algo- 
rithm, the 'improvement' is presented by the average cost change rate of the best solution 
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of certain generation. This change rate is evaluated by the following expression: 

^ = AR{k)= c(i)-c(i + i) 

At k — i 

where we assume the cost of the best solution of that generation changes at ;'th step and 
ChangeRate(£) = AR(k)is the average change rate of cost at kth step (k > i). Once this 
value is less than a certain lower-bound MinChangeRate, GA computation may be stopped. 

The updating process may be improved by assuming that the price of each link and the 
congestion of the network will change gradually. So when new traffic comes, the SP will 
recompute routes for its customers. During the dynamic operation of the system, in order 
to improve the efficiency of the algorithm, the results of the last computation can be partly 
reused. One possibility is to mix certain 'training genes' into the initial population of the 
new route computation. However, this may lead to premature discards and prevent GA from 
finding better solutions. Instead of mixing the past solution into the initial solution of GA, 
they may be mixed into population after, for example, 70 % of MinTrails of GA loops. In this 
way, we can still take advantage of the results of the last computation and prevent premature 
discards at the same time. If the network conditions change smoothly, we can take advantage 
of the past best solution of last computation. If the network conditions change dramatically at 
a certain time and the optimal route may totally differ from the past solution, GA will not take 
advantage of the past best solution by mixing the past solution into the population during the 
GA computation. The flow chart in Figure 18.4 summarizes the operation of the algorithm. 
The integration of the mobile agents into the routing algorithm is presented in Figure 18.5. 

As shown in the figure, BrkAs, MAs, BAs and DAs are used to migrate among different 
network elements to implement the proposed routing algorithm. Once the PC client needs 
a connection to the CP, an MA will be sent from PC client to SP containing information 
about the upper bound of setup time delay of the connection and the corresponding QoS 
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Figure 18.6 Algorithm for BrkAs in SP to choose a route for its client. 



requirements. After receiving the MAs from PC client, the SP creates a BrkA to deal with 
this connection requirement. 

This BrkA creates MAs containing source and destination information, as well as QoS 
requirements, and multicasts the agents to each NAB that it is connected with. Then the 
BrkA in SP waits for the agents from NABs to obtain the routing solution according to the 
scheme depicted in Figure 18.6. 
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As seen by the flow chart, in order to control the connection setup time, a timer is used 
to determine the deadline of the route searching procedure. If the BrkA receives an MA 
from NAB with satisfactory routing solution before the expiration of the timer, the route 
searching process stops and this solution is selected. Otherwise, when the timer expires 
the agent chooses the best route among the route candidates found until that time. Each 
NAB also creates a BrkAs to deal with the connection when it receives the MAs with the 
corresponding connection request from the SP. Then, three kinds of agents are used to 
implement the routing algorithm as follows. 

A browser agent will be created and sent to nodes inside the individual private network 
that the NAB belongs to. These agents will collect resource information such as available 
bandwidth, delay of the link, price of the link, etc. In a similar way, the BrkA in each NAB 
will also send out BAs to INABs to see if it can take advantage of network resources from 
other NPs. 

A daemon agent containing the GA code and resource-related information will be created 
after collecting the necessary resource information, to implement the routing algorithm 
described in detail in the previous section. Instead of executing the algorithm in each NAB, 
the BrkA sends DAs to the most suitable nodes inside its private network (e.g. nodes with 
enough computation resources such as CPU, memory, etc). In this way, we can balance the 
computation load among nodes in the private networks, if needed. 

A messenger agent will be used to bring results back to the BrkA from DAs after the 
genetic-based route computation. This agent will be forwarded to the BrkA in the SP. 
Performance results for the algorithm can be found in Papavassiliou et al. [35]. 



18.4 AD HOC NETWORK MANAGEMENT 

We start by identifying some of the properties of ad hoc networks that make them difficult 
to manage. 

18.4.1 Heterogeneous environments 

First of all, nodes of an ad hoc network can range in complexity from simple sensors 
located in the field to fully functional computers such as laptops. An implication of this 
diversity is that not all nodes will be able to contribute equally to the management task. For 
instance, it is likely that sensors and small personal digital assistant (PDA)-type devices will 
contribute minimally to the task of management, while more powerful machines will need 
to take on responsibilities such as collecting data before forwarding it to the management 
station, tracking other mobiles in the neighborhood as they move, etc. Thus, the management 
protocol needs to function in very heterogeneous environments. 

18.4.2 Time varying topology 

One mission of a network management protocol is to present the topology of the network 
to the network manager. In wireline networks, this is a very simple task because changes 
to the topology are very infrequent (e.g. a new node gets added, failure of a node, or 
addition/deletion of a subnetwork, etc.). In mobile networks, on the other hand, the topology 
changes very frequently because the nodes move about constantly. Thus, the management 



AD HOC NETWORK MANAGEMENT 



715 



station needs to collect connectivity information from nodes periodically. An implication 
of this is an increased message overhead in collecting topology information. 

18.4.3 Energy constraints 

Most nodes in ad hoc networks run on batteries. Thus, we need to ensure that the network 
management overhead is kept to a minimum so that energy is conserved. Different energy 
is consumed by a radio when a packet is transmitted or received. In addition, the CPU 
expends energy in processing these packets. Thus, we need to reduce the number of packets 
transmitted/received/processed at each node. This requirement is contradictory to the need 
for topology update messages previously discussed. 

18.4.4 Network partitioning 

Energy constraints and mobility can result in the network becoming partitioned frequently. 
For instance, nodes may power themselves off to conserve energy resulting in partitions, 
or a node may move out of transmission range of other nodes. Similarly, a node may die 
when its battery runs out of power. In all these cases, the partitioned subnetworks need to 
continue running independently, and the management protocol must be robust enough to 
adapt. For instance, when the network gets partitioned, the management protocol entities 
must quickly learn that the partition has occurred and reconfigure the subnetwork(s) to 
function autonomously. Furthermore, when partitions merge, the management protocol 
must be capable of updating the network view without too much of an overhead. 

18.4.5 Variation of signal quality 

Signal quality can vary quite dramatically in wireless environments. Thus, fading and 
jamming may result in a link going down periodically. An effect of this is that the network 
topology from a graph theoretic point of view changes. However, the physical layout of 
the network may not change at all. The management protocol must be able to distinguish 
this case from the case when node moves cause topology changes, because in the case of 
changing link quality/connectivity, it may not be necessary to exchange topology update 
messages at all. In order to be able to do this, the management protocol entity (which resides 
at the application layer) must be able to query the physical layer. This obviously violates 
the layering concept of OSI, but it results in enormous savings. 

18.4.6 Eavesdropping 

Ad hoc networks are frequently set up in hostile environments (e.g. battlesite networks) 
and are therefore subject to eavesdropping, destruction and possibly penetration (i.e. a 
node is captured and used to snoop). Thus, the management protocol needs to incorporate 
encryption as well as sophisticated authentication procedures. 

18.4.7 Ad hoc network management protocol functions 

In this section we will discuss some main functions of ad hoc network management protocol 
(ANMP). Data collection is a necessary function in ANMP where the management entities 
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collect node and network information from the network. SNMP specifies a large list of 
information items that can be collected from each node. However, this list does not include 
some crucial data items that are specific to the ad hoc environment like the status of the 
battery power (expected remaining lifetime), link quality, location (longitude and latitude), 
speed and direction, etc. All this information needs to be collected as (and when) it changes 
'significantly'. For example, the location of a mobile node changes continuously, but there 
is little point in updating it constantly because the overhead in message complexity is high. 
The best solution is to update this information when some other aspect of the node changes. 
For instance, if the node's connectivity changes as a result of the motion, then we may need 
to update its location. 

One problem that arises in ad hoc networks in relation to data collection is the message 
overhead. Ad hoc networks have limited bandwidth (whose quality is variable), and we 
must ensure that the process of management does not consume significant amounts of this 
resource. Since network management runs at the application layer, the simplest way to 
implement data collection (at the manager station) is to poll each node individually. This 
method, unfortunately, results in a very high message overhead. A more efficient method 
of data collection is to use a spanning tree rooted in the manager station. 

Configuration/fault management is needed because nodes in ad hoc networks die, move 
or power themselves off to save energy. In all of these cases, the network topology changes, 
and the manager station needs to know the fate of these nodes. In cases when a node 
is unavailable for the reasons just stated, the manager records that fact in its database. 
However, even in the case when a manager knows that a node is dead, the entry for that 
node is not removed from the database because the node may be resurrected (e.g. we put 
in a new battery) or, keeping in mind that ad hoc networks are temporary, we may need 
a complete history of the network's behavior to effect a redesign of protocols, evaluate 
security breaches, etc. 

New nodes may join a network periodically, and these nodes must be incorporated seam- 
lessly into the network. A network may also be partitioned periodically. In this case, we 
need to ensure that each partition selects its own manager. However, when these parti- 
tions merge, one common manager needs to be chosen. Manager selection must be done 
based on the hardware and software capabilities of nodes and the available battery power. 
We may also have geographically coexisting but independent networks. An example is a 
battlefield, where a naval unit may be physically collocated with an infantry unit, each 
of which is using its own ad hoc network. In this case, the two networks may decide to 
be managed together or continue being managed independently but exchange information 
(such as link quality, presence of jamming, etc.) with each other as an aid to better de- 
ploy the network resources. It is also possible that a node may belong to two different 
networks and be managed by two (or more) managers. An example is a disaster relief 
model, where a police officer may remain on a police (secure) ad hoc network but si- 
multaneously be connected (and managed) to an ad hoc network of medical relief teams. 
In 4G management, protocols for ad hoc networks must be able to operate in all these 
scenarios. 

Security management deals with security threats [42-53, 55]. Ad hoc networks are very 
vulnerable to security threats because the nodes (e.g. the unmanned nodes) can easily 
be tampered with, and signals can be intercepted, jammed or faked. Current protocols, 
such as SNMPv3 [41, 56, 62, 63], do provide us with some mechanisms to guard against 
eavesdropping and replay attacks using secure unicast, which is not efficient for all incoming 
network architectures. 
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ANMP designing should address the issues raised above. It should be also compatible 
with SNMP. This is necessary because: (1) SNMP is a widely used management protocol 
today; and (2) ad hoc networks can be viewed as extensions of today's networks that are 
used to cover areas lacking a fixed infrastructure. In operation, it is quite likely that an 
ad hoc network would be connected to a local area wireline network (using a gateway). In 
such cases, ANMP manager should be designed to be viewed either as a peer of the SNMP 
manager (which is managing the wireline network) or as an agent of the SNMP manager. 
This flexibility is a major strength of ANMP. Obviously ANMP can operate in isolated 
ad hoc networks as well. In the next section, we provide an overview of the possible design 
choices in ANMP with the following constraints [39]: 

(1) the PDU structure used is identical to SNMP's PDU structure; 

(2) UDP is the transport protocol used for transmitting ANMP messages; 

(3) lost data is not retransmitted by ANMP because information is periodically updated 
anyway. Furthermore, if the application sitting on top of ANMP wishes to obtain the 
lost information, it can request the ANMP manager to solicit that information again. 

18.4.8 ANMP architecture 

In order to have a protocol that is message-efficient, a hierarchical model for data collection 
is appropriate, since intermediate levels of the hierarchy can collect data (possibly producing 
a digest) before forwarding it to upper layers of the hierarchy. A problem, however, with 
utilizing a hierarchical approach in ad hoc networks is the cost of maintaining a hierarchy in 
the face of node mobility. A good tradeoff is to use a three-level hierarchical architecture for 
ANMP. Figure 18.7 illustrates this architecture. The lowest level of this architecture consists 
of individual managed nodes called agents. Several agents (that are close to one another) 
are grouped into clusters and are managed by a cluster head (the nodes with squares around 
them in the figure). The cluster heads in turn are managed by the network manager. It is 
important to emphasize that: (a) clustering for management (in ANMP) is very different 
from clustering for routing, as we discuss in Chapter 13; and (b) a manager is frequently 
more than one hop away from the cluster heads (Figure 18.7 is a logical view and not a 
physical view). 




Figure 18.7 ANMP's hierarchical architecture. 
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The structure of the clusters is dynamic. Thus, as nodes move about, the number and 
composition of the clusters change. Similarly, the nodes serving as cluster heads also change 
over time. Different algorithms for forming and maintaining clusters will be discussed below. 
The clusters should have the following properties: 

(1) The clusters are neither too large nor too small. The message overhead of collecting 
data within large clusters will be high. Likewise, if we have very small clusters, then 
there will be many cluster heads, all of which will be controlled by the manager. Thus, 
the message overhead in transactions between the cluster heads and the manager is 
high. 

(2) The clusters are formed such that node mobility does not result in frequent recom- 
putation of the clusters. This property is necessary if we are to reduce the message 
overhead of maintaining clusters. 

(3) Sometimes nodes move out of one cluster and into another but are not incorporated 
into the new cluster immediately. This is because cluster maintenance algorithms 
only run periodically and not continuously. The effect of this is that some percentage 
of nodes may be unmanaged by cluster heads for short periods of time. This is not 
really a problem (except for the message overhead in data collection) because these 
nodes are still in communication with the overall manager, and they can be directly 
managed by the manager. 

It is important to make a distinction between the use of clusters for management vs clusters 
for routing. Clustering in ANMP is used to logically divide the network into a three-level 
hierarchy in order to reduce the message overhead of network management. Since ANMP 
is an application-layer protocol, ANMP presupposes the existence of an underlying routing 
protocol. Thus, the manager node can always reach any of the nodes in the ad hoc network 
(so long as they both lie in the same partition) and can manage them directly. Clustering 
simply introduces intermediate nodes called cluster heads for the purpose of reducing the 
message overhead. Thus, clustering algorithms used for management serve a weaker and 
different objective when compared with clustering algorithms used for routing. 

In cluster-based routing [40], neighboring nodes form clusters and select a cluster head. 
Cluster heads have the responsibility of routing packets sent to nodes outside the cluster. It 
is easy to see that clustering here serves a fundamental goal of maintaining routes. Finally, 
we note that if the underlying routing protocol is cluster-based, ANMP could simply use 
these clusters for management as well. However, if the routing protocol is not cluster-based 
[41], the two clustering algorithms describe here form clusters and rely on routing support 
to exchange control messages. 

Graph-based clustering is described first as a basic concept. After that the maintenance 
algorithm that deals with node mobility after clusters have been formed will be discussed. 
A node in the graph represents a mobile host in the network. There is an undirected link 
between two nodes if they are within transmission range of each other. For the purpose of 
clustering we assume the following. 

(1) each node in the network has a unique ID; 

(2) each node in the network maintains a list of its one-hop neighbors (recall that ANMP 
runs at the application layer, and therefore it can obtain this information from the 
network or MAC layer); 
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Figure 18.8 Clusters formed using graphical clustering. 



(3) Messages sent by a node are received correctly by all its neighbors within a finite 
amount of time. 

In the algorithm, the node with minimum ID among its neighbors (which have not joined 
any other cluster) forms a cluster and becomes the cluster head. Upon hearing from a cluster 
head, each node that has not yet joined any cluster declares that it has joined a cluster. If 
any node has been included in more than one cluster, then all but one cluster head prunes 
the node entry from their list when they do not hear any information from that node. 

Figure 18.8 illustrates cluster formation in a simple ad hoc network. The node with the 
minimum ID forms the cluster and becomes the cluster head. Here node 1 is the minimum 
ID node among its neighbors; therefore, it forms a cluster CI. Node 4 does not initiate 
cluster formation because it is not the minimum ID node among its neighbors. When node 2 
broadcasts a message that it has joined cluster C 1 , node 4 realizes that it is now the minimum 
ID node. Since the cluster formation runs in a distributed way, it is possible that, by the time 
node 4 receives the broadcast message from node 2, node 3 has already initiated cluster 
formation, and node 5 also sends a message that it has joined some cluster C3. Thus, when 
node 4 starts cluster formation, it only includes the remaining nodes among its neighbors 
and from C2. 

Because the node with minimum ID considers only its one-hop neighbors while forming 
the cluster, the nodes in a cluster are one hop away from the cluster head and at most 
two hops away from any other cluster mate when the cluster is formed. The information 
maintained by each node after clusters are formed is: 

• a neighbor list - a list of nodes that are one hop away; 

• a cluster list - a list of all nodes that are its cluster mates; 

• a ping counter - a counter that indicates the time steps elapsed since it last heard from 
its cluster head. 

Cluster maintenance for graph-based clustering deals with the changes in the network 
topology. As a result, the clusters and cluster membership have to be updated. Changes in 
cluster membership are triggered when a node moves out of a cluster (and into another) 
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Figure 18.9 Effect of node mobility on clusters. 



or when the cluster head itself moves out of a cluster (or, relatively speaking, cluster 
members move away from the cluster head). Figure 18.9 presents some illustrative cases. 
Figure 18.9(a) shows a situation where nodes move about but are still connected to at least 
one of their cluster mates (see Figure 18.8). Here, there is no need to recompute clusters. 
Figure 18.9(b) shows two scenarios: (1) when a node moves across the cluster boundary; 
and (2) when the cluster head gets disconnected. It can be seen that node 4 gets disconnected 
from all the members of its previous cluster. Since node 4 is two hops away from the cluster 
head of cluster CI, it sends a join request to node 2. On receiving such event from node 4, 
node 2 adds node 4 to its cluster list and broadcasts it to all the members. Meanwhile, 
nodes 10 and 1 1 discover that their cluster head has moved away, and they initiate cluster 
formation and form a new cluster C4. The previous example indicates an important property 
of the maintenance algorithm. 

When new clusters are formed, all nodes in the cluster are one hop away from the cluster 
head. However, as nodes move about, we allow nodes to join clusters even if they are two 
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hops away from the cluster head of an existing cluster. This flexibility drastically reduces 
the message overhead of the algorithm. 



18.4.8. 1 Performance of cluster maintenance algorithm 

If a node is connected to at least one node in its cluster list, it is still considered to be part 
of that cluster. If a node detects that it has no links to any of its previous cluster mates, it 
either forms a new cluster by itself or joins another cluster. The algorithm should be able to 
differentiate between node movements within the cluster and movements across the cluster 
boundary. Four types of events that a node can detect as a result of mobility are identified. A 
node can detect: (1) a new neighbor, who is also a cluster mate; (2) that a previous neighbor 
and cluster mate has moved; (3) that it was previously directly connected to the cluster head 
but is no longer directly connected, or it was previously not directly connected but is now 
directly connected; (4) a new neighbor, who wants to join the cluster. 

At every fixed interval, called a time step, each node locally creates a list of events that 
it has observed and sends it to its cluster head. The cluster heads collects these events and 
recomputes the cluster list. If there is any change in the cluster membership, the cluster head 
broadcasts the new list. Thus, whenever a node moves out of a cluster or joins a cluster, 
the message exchange is restricted to within that cluster. In order to minimize the number 
of cluster changes, a node is allowed to join a cluster if the cluster head is two hops away. 
The restriction of two hops (as opposed to, say, three hops) has been enforced to avoid the 
creation of big clusters. 

In such a division of the network, the cluster head plays an important role. That is why a 
major event that can occur is the movement of the cluster head. To determine if the cluster 
head has moved away, a ping counter is used at each node. This counter gets incremented 
at every time step. If the counter at the cluster head crosses some threshold value, then the 
cluster head sends a ping message to all its cluster mates, indicating that it is still alive. If 
the cluster mates do not hear a ping after their ping counters cross a threshold, they assume 
that the cluster head is either dead or has moved out. Once a node detects such an event, it 
cannot be certain about its cluster list. New cluster(s) are formed with one-hop neighbors 
in the same way as the clusters were initially formed. It is easy to see that the frequency at 
which these pings are sent plays an important role in maintaining the clusters. If the ping 
frequency is small, then between consecutive pings, some nodes may be unmanaged by 
a cluster head (i.e. they do not belong to any cluster). Unfortunately, even though a high 
frequency of pings minimizes the number of nodes unmanaged by cluster heads, it results 
in a higher message overhead. 

One method of reducing the number of ping messages while simultaneously keeping 
the fraction of nodes unmanaged by clusterheads small is to exploit information available 
at the MAC layer. The MAC layer in wireless networks periodically transmits a beacon to 
announce itself to its neighbors. Thus, the MAC layer keeps an updated list of its one-hop 
neighbors. If nodes transmit this list to their neighbors, changes in the cluster membership 
can be detected quickly. If the cluster head moves away, its departure will be noticed by the 
MAC layer of its one-hop neighbors. These nodes can act on this information to quickly 
reform clusters. 

Another characteristic of ad hoc networks are partitions. If a subnetwork gets partitioned 
from the main network, it is treated as any other cluster because the protocol does not require 
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Figure 18.10 Message overhead and percentage of nodes unmanaged by cluster heads. 



any information exchange between clusters. If a single node gets partitioned, it forms a 
cluster by itself. However, when it gets reconnected, it tries to join another cluster because 
the clusters of too small or too large a size are both inefficient from the point of view of 
network management. 

To study the performance of this clustering algorithm, an ad hoc network was simulated 
in Chen et al. [39] in which the 30 nodes move randomly within a 1500 x 1500 unit box 
(for 60 nodes, the area of the playground was twice that, and so on). Each node selects a 
direction randomly uniformly in the range [0, 360] degrees. It moves along that direction 
for a length of time that is exponentially distributed. At the end of this time, it recomputes 
the direction and traveling time. When a node hits the edge of the box, it wraps around and 
emerges from the opposite edge. In the simulation, the same transmission power was used 
for all the nodes in the network and the ad hoc network was represented as an undirected 
graph with links between two nodes if they are within transmission range of each other. The 
average speed of nodes is 1-50 unit/s in different runs. The transmission range of a node 
is fixed at 450 units. Each reading is an average of 10 readings, and the simulation time is 
1000 s. Finally, a packet loss probability of 10~ 3 in all simulations was assumed. 

Figure 18.10 (upper part) shows the message overhead of the protocol for the case when 
only pings to detect topology changes (graph on the left) and when pings along with MAC- 
layer information (the graph on the right) are used. The .T-axis indicates the speed in units 
per second, and the y-axis shows the number of messages exchanged per second for cluster 
maintenance. Each curve in the graph depicts the message overhead incurred for different 
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ping intervals, which vary from no ping (equivalent to infinite time steps) down to one ping 
message every time step. It may be noted that message overhead increases almost linearly 
with increase in speed. The message overhead when we use MAC-layer information shows 
similar behavior, but there are more messages exchanged. This is obvious because of the fact 
that nodes do not wait for the ping message from the cluster head. The clustering algorithm 
is triggered as soon as the nodes detect cluster head movement. 

The same figure (lower part) shows the percentage of nodes unmanaged by cluster 
heads. The plot on the left is for the case when only pings are used to detect topology 
changes, while the graph on the right is for the case when pings as well as MAC-layer 
information to detect topology changes are used. It can be seen from the graph that at 
higher speeds the percentage of nodes unmanaged by cluster heads goes up. This is mainly 
because of frequent disconnections and partitions generated in the network. Interestingly, 
the percentage of nodes unmanaged by cluster heads is as high as 50-70 % when we only 
use pings, but stays below 10 % when we use MAC-layer connectivity information as well. 
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19 

Network Information Theory 



Information theory has made a significant contribution to the development of communica- 
tion theory and practice. This is especially true in the domain of physical layer including 
channel capacity issues, coding and modulation. Unfortunately, information theory has 
not yet made a comparable mark in the field of communication networks, which is to- 
day the center of activity and attention in most information technology areas. The principal 
reason for this is twofold. First, by focusing on the classical point-to-point, source-channel- 
destination model of communication, information theory has ignored the bursty nature of 
real sources. In advanced networks, source burstiness is the central phenomenon that un- 
derlies the process of resource sharing for communication. Secondly, by focusing on the 
asymptotic limits of the tradeoff between accuracy and rate of communication, information 
theory ignored the role of delay as a parameter that may affect this tradeoff. In networking, 
delay is a fundamental quantity, not only as a performance measure, but also as a parameter 
that may control and affect the fundamental limits of the rate-accuracy tradeoff. 

A comprehensive survey of information theory contributions to the study of different 
network layers is given in Ephremides and Hajek [1]. The effective capacity of communi- 
cations links including time-varying channel capacity, adaptive coding and modulation and 
queueing has already been discussed in Chapters 4 and 8. In this chapter we additionally 
discus the effective capacity of advanced cellular network, transport capacity of wireless 
networks and network coding. 



19.1 EFFECTIVE CAPACITY OF ADVANCED CELLULAR NETWORKS 

In this section we discuss effective capacity and capacity losses in an advanced CDMA 
network due to imperfections in the operation of the system components. In addition to the 
standard WCDMA technology both base stations and mobile units use antenna beam form- 
ing and self-steering to track the incoming (and transmitted) signal direction. By using high 
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directivity antennas and antenna pointer tracking, the level of multiple access interference 
(MAI) and the required transmitted power are reduced. In order to exploit the available 
propagation diversity signals arriving from different directions (azimuth ty, elevation <p) 
and delay r are combined in a three-dimensional {%[r, cp, x) RAKE receiver. This is expected 
to significantly improve the system performance. In this section we present a systematic 
mathematical framework for capacity evaluation of such a CDMA network in the presence 
of implementation imperfections and a fading channel. The theory is general and some 
examples of a practical set of channel and system parameters are used as illustrations. As 
an example, it was shown that, in the case of voice applications and a two-dimensional 
(4 antennas x 4 multipaths) RAKE receiver, up to 90 % of the system capacity can be lost 
due to the system imperfections. Further elaboration of these results, including extensive 
numerical analysis based on the offered analytical framework, would provide enough back- 
ground for understanding of possible evolution of advanced W-CDMA and MC CDMA 
towards the fourth generation of mobile cellular communication networks. 

The physical layer of the third generation of mobile communication system (3G) is based 
on wideband CDMA. The CDMA capacity analysis is covered in a number of papers and 
recently has become a subject in standard textbooks [2-8]. 

The effect of more sophisticated receiver structures (like multiuser detectors MUD or 
joint detectors) on CDMA or hybrid systems capacity has been examined in [9-13]. The 
results in Hamalainen et al. [9] show roughly twofold increase in capacity with MUD 
efficiency 65 % compared with conventional receivers. The effect of the fractional cell load 
on the coverage of the system is presented in [ 10] . The coverage of MUD-CDMA uplink was 
less affected by the variation in cell loading than in conventional systems. References [11] 
and [ 1 2] describe a CDMA system where joint data estimation is used with coherent receiver 
antenna diversity. This system can be used as hybrid multiple access scheme with TDMA 
and FDMA component. In Manji and Mandayam [13] significant capacity gains are reported 
when zero forcing multiuser detectors are used instead of conventional single-user receivers. 

In most of the references it has been assumed that the service of interest is low rate 
speech. In next generation systems (4G), however, mixed services including high rate data 
have to be taken into account. This has been done in Huang and Bhargava [14], where 
the performance of integrated voice/data system is presented. It is also anticipated that 4G 
will be using adaptive antennas to further reduce the MAI. The effects of adaptive base 
station antenna arrays on CDMA capacity have been studied [6, 7]. The results show that 
significant capacity gains can be achieved with quite simple techniques. 

One conventional way to improve cellular system capacity, used in 3G systems, is 
cell splitting, i.e. sub-dividing the coverage area of one base station to be covered by 
several base stations (smaller cells) [15]. Another simple and widely applied technique to 
reduce interference spatially in 3G is to divide cells into sectors, e.g. three 120° sectors. 
These sectors are covered by one or several directional antenna elements. The effects of 
sectorization to spectrum efficiency are studied in Chan [16]. The conclusion in Chan 
[16] is that sectorization reduces co-channel interference and improves the signal-to-noise 
ratio of the desired link at the given cluster size. However, at the same time the trunking 
efficiency is decreased [17]. Owing to the improved link quality, a tighter frequency reuse 
satisfies the performance criterion in comparison to the omnicellular case. Therefore, the 
net effect of sectorization is positive at least for large cells and high traffic densities. 

By using M-element antenna arrays at the base station the spatial filtering effect can 
be further improved. The multiple beam adaptive array would not reduce the network 
trunking efficiency, unlike sectorization and cell splitting [18]. These adaptive or smart 



EFFECTIVE CAPACITY OF ADVANCED CELLULAR NETWORKS 



729 



antenna techniques can be divided into switched-beam, phased array and pure adaptive 
antenna systems. Advanced adaptive systems are also called spatial division multiple access 
(SDMA) systems. Advanced SDMA systems maximize the gain towards the desired mobile 
user and minimize the gain towards interfering signals in real time. 

According to Winters [19], by applying a four-element adaptive array at the TDMA, 
uplink frequencies can be reused in every cell (three-sector system) and sevenfold capacity 
increase is achieved. Correspondingly, a four-beam antenna leads to reuse of three or four 
and doubled capacity at small angular spread. 

Some practical examples of the impact of the use of advanced antenna techniques on 
the existing cellular standards are described in References [20, 21]. In Petrus et al. [20] the 
reference system is AMPS and in Mogensen et al. [2 1 ] it is GSM. The analysis in Kudoh and 
Matsumoto [22] uses ideal and flat-top beamformers. The main lobe of the ideal beamformer 
is flat and there are no sidelobes, whereas the flat-top beamformer has a fixed sidelobe level. 
The ideal beamformer can be seen as a realization of the underloaded system, i.e. there are 
less interferers than there are elements in the array. The overloaded case is better modeled 
by the flat-top beamformer because all interferers cannot be nulled and the sidelobe level 
is increased. Performance results show that reuse factor of one is not feasible in AMPS, 
but reuses four and three can be achieved with uniform linear arrays (ULA) with five 
and eight elements, respectively. Paper [21] concentrates on the design and performance 
of the frequency hopping GSM network using conventional beamforming. Most of the 
results are based on the simulated and measured data of eight-element ULA. The simulated 
C/I improvement follows closely the theoretical gain at low azimuth spreads. In urban 
macrocells the C/I gain is reduced from the theoretical value 9 dB down to approximately 
5.5-7.5 dB. The designed direction of arrival (DoA) algorithm is shown to be very robust to 
co-channel interference. The potential capacity enhancement is reported to be threefold in 
a 1/3 reuse FH-GSM network for an array size of M = 4—6. A number of papers [23-30] 
present the analysis of capacity improvements using spatial filtering. 

Solutions in 4G will go even beyond the above options and assume that both base 
station and the mobile unit are using beam forming and self-steering to continuously track 
transmitter-receiver direction (two side beam pointer tracking 2SBPT). Owing to user 
mobility and tracking imperfections there will be always tracking error that will result in 
lower received signal level, causing performance degradation. In this chapter we provide 
a general framework for performance analysis of a network using this technology. It is 
anticipated that this technology will be used in 4G systems. 

19.1.1 4G cellular network system model 

Although the general theory of MIMO system modeling is applicable for the system descrip- 
tion, performance analysis will require more details and a slightly different approach will 
be used. This model will explicitly present signal parameters sensitive to implementation 
imperfections. 

19. 1. 1. 1 Transmitted signal 

The complex envelope of the signal transmitted by user k e {1,2, . . . , K] in the nth symbol 
interval t € [nT, (n + l)T] is 

s k = A k T k (f, <pyej*» s to\ t _ Xk) (i9.i) 
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where A k is the transmitted signal amplitude of user k, 7\(^r, <p)is the transmitting antenna 
gain pattern as a function of azimuth i/r and elevation angle tp, x k is the signal delay, (p k o is 
the transmitted signal carrier phase, and S k "\t ) can be represented as 

S k " (t) - S k " — S k — S ik + jS qk - d ik c ik + jd qk c qk (19.2) 

In this equation d ik and d qk are two information bits in the I- and Q-channels, respectively, 
and cjf m and c q " km are the mth chips of the kth user PN codes in the I- and Q-channel 
respectively. Equations (19.58) and (19.63) are general and different combinations of the 
signal parameters over most of the signal formats of practical interest. In practical systems 
the codes will be a combination of a number of component codes [2]. 



19. 1. 1.2 Channel model 

The channel impulse responses consist of discrete multipath components represented as 

hf{f, <p,t) = jr hifw, <p) S (t - 4?) = £ 4%, 9 ^s(f - 4?) 
i=i i=i 

If antenna lobes are narrow we can use a discrete approximation of this functions in spatial 
domain too and implement 3D RAKE receive as follows: 

L 

hfit, <p, t) = Yi h< ki & (y - v - t - (i93a) 

= J2 H^>8 U-ir kl ,cp- 9kl , t - 4?) 
1=1 

h ( k f = H^t^ (19.3b) 

where L is the overall number of spatial-delay multipath components of the channel. Each 
path is characterized by a specific angle of arrival <p)and delay r. Parameter h kl is 
the complex coefficient (gain) of the /th path of user k at symbol interval with index n, 
4? € [0, T m ) is the delay of the /th path component of user k in symbol interval n and S(t) 
is the Dirac delta function. We assume that T m is the delay spread of the channel. In what 
follows, indices n will be dropped whenever this does not produce any ambiguity. It is also 
assumed that T m < T. 



19.1.2 The received signal 



The base station receiver block diagram is shown in Figure 19.1 [31]. The overall received 
signal at the base station site during Ni, symbol intervals can be represented as 



N k -1 K 



r(t) = Re e j0 *" * ^(O } + Re {z(0e y ' ffl0 ' } 



= Re 



«=0 k= 
N h -1 



n=0 k I 



+ Re{z(0e >0 '} (19.4) 
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Desired user 




Information 
output 



Interference 



Figure 19.1 Receiver block diagram. 



where a kl = A k T k {f, tp)H^**> = A k T k {f, <p)H ( k f = A' u , *„ = </>„ + <p k0 - 

4> k i, (j)Q is the frequency down-conversion phase error and z{t) is a complex zero mean 
additive white Gaussian noise process with two-sided power spectral density a 2 and cdq 
is the carrier frequency. In general, in the sequel we will refer to A' a as received signal 
amplitude. This amplitude will be further modified by the receiver antenna gain pattern. 
The complex matched filter of user k with receiver antenna pattern R k {^r, (f) will create two 
correlation functions for each path: 

(n+l)T+T t +z t , 

r(t)R k {f, ip)c ik (t -nT -x k + r«) cos(&> 0 ? + O w ) dt (19.5) 

nT+rt+ru 

= A k'V [di k 'Pwv,iki cos sti'M +d q k'p q k'i',iki sin = ^ ^ yikijk'l') 

k' /' V V 

where A k >v — A' k , v R k {f, q>), parameter i> k i is the estimate of <$> k i and 
yiki(k'l') = y iikl (k'l') + y iqkl (k'l') 

— A k >i>[d ik > Pik'V ,ikl COS £#/',« + d qk > Pqfl'jkl sm £k'l',kl] 



(») 

y ikl 



1)1 -t- 

/ 



00 

y q ki 



(n+l)T+x k +T t , 

J r{t)R k {f, <p)c qk (t - nT - r k + t k! ) sm(co 0 t + <t> w ) dt 

nT+n+m 

y A k'i\dqk'Pqkn' , q ki cos s k 'c k i — d ik! p ik!V sin£ W / W ] 



it' p 



y q ki{k'l') = y qqkl {k'l') + y qikl {k'l') 

— A k >i>[d qk > p qk n', qk i cose k 'i tk i — d ik >p ik >i>,q k i sin 



(19.7a) 



(19.7b) 



where ^ are cross-correlation functions between the corresponding code components x 
and v. Each of these components is denned with three indices. Parameter s a $ = ® a — i> h 
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where a and b are defined with two indices each. In order to receive the incoming signal 
without any losses, the receiving antenna should be directing (pointing) the maximum of 
its radiation diagram towards the angle of arrival of the incoming signal. In this segment we 
will use the following terminology. The direction of the signal arrival is characterized by the 
pointer p a = (ip- a , tpn). In order to receive the maximum signal available, the receiver antenna 
pointer should be p r — (ijf t , cp r ) — (i/r a + it, <p a + it). Owing to mobility, the receiver will 
be tracking the incoming signal pointer and the pointer tracking error will be defined as 
Ap — p r — p d — (i/f r — i/r a , (pj — (p. d ) — (Ai/f, A<p). Owing to this error the amplitude of the 
received signal will be reduced by s p with respect to the maximum value. These issues will 
be elaborated later. When necessary, we should make a distinction between the amplitude 
seen by the receiver (asr) for a given pointer tracking error Ap and the maximum available 
amplitude (maa) obtained when Ap — 0. Let the vectors 9f( ) of MF output samples for the 
nth symbol interval be defined as 



y q k — ^ 



(y%) = {y% y% ■ • ■ y%) - e c L aw 



yf = y^ + 7y«; = 3* (,<">) , eC KL ; y = Z N »(y M ), eC N > KL 



(19.8c) 



19. 1.2. 1 CDMA system capacity 



The starting point in the evaluation of CDMA system capacity is the parameter Y m — 
Ehm /No, the received signal energy per symbol per overall noise density in a given reference 
receiver with index m. For the purpose of this analysis we can represent this parameter in 
general case as 

Ehm ST 

Y m = -^- = (19.9) 

"o ^oc + hie + loin + »7th 

where / oc , / 0 ; c and / 0 ; n are power densities of intracell-, intercell- and overlay-type inter- 
network interference, respectively, and r] th is thermal noise power density. S is the overall 
received power of the useful signal and T = l/Rb is the information bit interval. Contri- 
butions of / 0 i C and / 0 ii, to A^o have been discussed in a number of papers [2]. In order to 
minimize repetition in our analysis, we will parameterize this contribution by introduc- 
ing = ^oic + ^oin + >7th and concentrate on the analysis of the intracell interference in the 
CDMA network based on advanced receivers using imperfect rake and MAI cancellation. A 
general block diagram of the receiver is shown in Figure 19.1. An extension of the analysis, 
to both intercell and internetwork interference, is straightforward. 

19.1.3 Multipath channel: near-far effect and power control 

We start with the rejection combiner, which will choose the first multipath signal component 
and reject (suppress) the others. In this case, Equation (19.9) for the I-channel becomes 

ot iiml {m\)S / R b 



Yi 



ibm 



{a iqml {m\) + I(k'l')}S/R h + m 

<Xiim\{ml) 

a iqml (ml) + I(k'l') + t] 0 R b /S 



(19.10) 
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where a x (z), (for x = iiml, iqm 1 , im 1 and z — ml, k'l') is the power coefficient defined 
as a x (z ) = E e { | y x (z ) | 2 } /S , S is normalized power level of the received signal and parameters 
y x (z) are in general defined by Equation (19.6). 

K L 

I{k'l')=Y J J«imiW) 

t=1 I' = l(k' ^ m) 
I' = 2(k' = m) 

is the equivalent MAI. E s { } stands for averaging with respect to corresponding phases e a j 
defined by Equation (19.7). Based on this, we have 

a iml {k'l') = E s {yf ml (k'l')} 

= A 2 k , v p> ml {k'l')/2 4 ; ,/2 => {A k ,H m ,T k ,(ir, <p)R m (f, <p)} 2 (19.11) 

where 

2 / / // 2 2 2 f 2 21 

Piml(k I ) = Pik'l'Jml + Pqk'l'Jml' P = Ep [Pik'l'Jml + Pqk'l'Jml] 

and normalization A|, ; ,p 2 /2 A|, /; /2. 

A similar equation can be obtained for the Q-channel too. It has been assumed that all 
'interference per path' components are independent. In what follows we will simplify the 
notation by dropping all indices iml so that a im i(&'/') au- With no power control (npc) 
dki will depend only on the channel characteristics. In partial power control (ppc) only the 
first multipath component of the signal is measured and used in a power control (open or 
closed) loop. Full power control (fpc) will normalize all components of the received signal 
and rake power control (rpc) will normalize only those components combined in the rake 
receiver. The (rpc) control seems to be more feasible because these components are already 
available. These concepts for ideal operation are defined by the following equations 

npc au — au , V£, /; ppc =3- Uk\ = l, Vk 

fpc 4E%= 1, Vk; rpc =► £ «„ = 1, Vk (19.12) 
i=\ i=i 

where Lq is the number of fingers in the rake receiver. The contemporary theory in this field 
does not recognize these options which causes a lot of misunderstanding and misconceptions 
in the interpretation of the power control problem in the CDMA network. Although fpc is 
not practically feasible, the analysis including fpc should provide the reference results for 
the comparison with other, less efficient options. Another problem in the interpretation of 
the results in the analysis of the power control imperfections is caused by the assumption 
that all users in the network have the same problem with power control. Hence, the imperfect 
power control is characterized with the same variance of the power control error. This is 
more than pessimistic assumption and yet it has been used very often in analyses published 
so far. The above discussion is based on the signal amplitude seen by the receiver (asr). 
System losses due to difference between the asr and maa will be discussed in the next 
section. If we now introduce matrix a m with coefficients \\ctki \\ , V£, / except for a m \ — 0 
and use notation 1 for vector of all ones, Equation (19.10) becomes 
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Compared with Equation (19.10), the index i is dropped in order to indicate that the same 
form of equation is valid for both, the I- and Q-channel defined by Equation (19.8). 



19.1.4 Multipath channel: pointer tracking error, rake receiver 
and interference canceling 

If Lo- nn g ers ra ke receiver (Lo < L) with combiner coefficients w mr (r — 1,2, ... , Lq) and 
interference canceller are used, the signal-to-noise ratio will become 

r (L«) 

f(m, a, c, r)K + g 0 r]R h /S 



where 



So = ^ w mr = w m w^; w m = (w ml , w m2 , . . . , w mn ) (19.15) 



r=l 

is due to Gaussian noise processing in the rake receiver, and noise density 170 becomes r\ 
due to additional signal processing. Also we have 

Y K Lq L ^ Lq L 

f(m, a,c,r)=-£ £ J2 w lr«uQ - C u ) + — ^tu^a^l - C ml ) 

A t=l r=l /=1 r=l '=1 

i#n //r 

= -Uw m (l-cw -l T )wT} (19.16) 

withQ; rm; . being a matrix of size K x L with coefficients 11(5^(1 — C k i)\\ except for a mr ( 1 — 
C mr ) = 0 and C m i is efficiency of the canceller. The parameter a u — E p ^^ {a a } is the 
average value of the interfering signal power a k i coming from direction (azimuth, elevation) 
(fki, <Pki) = (f m + Ai/a-/, <Pm + &<Pki) with respect to the reference pointer p(f m , p m )ofthe 
useful signal. Formally this can be represented as 

"« = // // a k ,(f,(p)PDF(Af kl , Acp kl ) d^ d<p d(Af kl ) d(A(p kl ) (19.16a) 

were PDF( A \fr k i , A<p k i) is probability density function of the arguments. For the first insight 
into the system performance a uniform distribution of Ai/a-; and A(p k i can be assumed along 
with a rectangular shape of T(i/r, cp) and R(is, <p) in the range \[r e tJ/q + 4* , <p e <po + <t\ 
so that evaluation of Equation (19.16a) becomes trivial. For the amplitudes A k \ seen by the 
receiver asr, the parameter r^f o) in Equation (19.14), called rake receiver efficiency, is given 
as 



r (L 0 ) 



Wmr COS Semmr^/a^ j = (W m ■ Oi mm ^ (19.17) 



with 



OL mm j — (COS eemml^Ja^, COS Se mm 2 */Um~2, . . -) 1 



Parameter sg mmr = 9 mmr — 9 mmr is the carrier phase synchronization error in receiver m 
for signal of user m in path r. We will drop index mkl whenever it does not result into 
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any ambiguity. In the sequel we will use the following notation: ccm — A 2 kl /2, A m u is the 
estimation of Am by the receiver m, e a — AA m M/ 'Am = (Am — A m ki)l 'Am is the relative 
amplitude estimation error, e m — BER — bit error rate that can be represented as mm — 
1 —2BER, and s g — carrier phase estimation error. 

For the equal gain combiner (EGC) the combiner coefficients are given as w mr = 1. 
Having in mind the notation used so far in the sequel, we will drop index m for simplicity. 
For the maximal ratio combiner (MRC) the combiner coefficients are based on estimates as 



cos£ e ,. A r ^(1-4-/2) A r (l-s ar ) 

VU r — • — 7 ~ r ' 

coseei A, (l-eyl) A,(l - s al ) 
E {w r } = w r (1 - a 2 .) (1 + a^) (1 - s ar ) (1 + s al ) 
E {w 2 } = w 2 (1 - 2al + 3al.) (l + 2a 2 - 3a A el ) (1 - e ar f(l + s al f 

Averaging Equation (19.17) gives for EGC 

\ 2 



(19.18) 
(19.19) 



E {r (Lo) } = E 



COS S0 rs fo^ 



= E 



E(i- 4/2) 



= EE( 1 - a ») ( l - a o>) + E ( l - 2 4 + 3 4) ( 19 -2Q) 



For MRC the same relation becomes 

' L 0 



£{r (L ">} 



«r (l-4/2) 2 d-£ flr ) \ 2 
/«! (1 - 4/2) (1 - £«,) i 



\tt a, (1-4/2) 2 (l-e a i) 2 j 
+ EE°?( 1 - 24 + 34)d " 24 + 34) 



x (1 + 2a 2 - 3a e 4 ,) (1 - e ar )(\ - s al )(\ + s al f 

In order to evaluate the first term we use limits. For the upper limit we have 4 
using this we have 



(19.21) 

p 2 



4- fi y 



(1 - 4-/2) 4 /(i - 4/2) 2 => (i - 4/2) 2 



and the first term becomes 



E - (! - 2 4 + 3 4) (1 - e-) 2 /d " e.i) 2 



r=l 



For the lower limit we use 4 =^ 4 an d the first term becomes 



La 2 

a 



E — (1 - 24 + 34) (1 - e-) 2 /d " s a ,f 
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For a signal with the I- and Q-component the parameter cos sg r should be replaced by 
cos Eo r =>■ cos sg r + mp sin s gr , where m is the information in the interfering channel (I or 
Q), and p is the cross-correlation between the codes used in the I- and Q-channel. For small 
tracking errors this term can be replaced as cos £g r + mp sin s$ r 1 + mps — s 2 /2, where 
the notation is further simplified by dropping the subscript ( ) 6r . Similar expressions can be 
derived for the complex signal format. In the above discussion the signal amplitude seen by 
the receiver (asr) is used for A r . In the presence of pointer estimation error this is related 
to the (mad) as A r =>- A r — s p . So, to account for the losses due to s p , parameter a r in 
the above equation should be replaced by a r =>■ E(A r — s p ) 2 — a r — 2s p ^/a r + a 2 where 
s p and a 2 = E(e 2 ) are the mean and variance of the pointer tracking error. The power 
control will compensate for the pointer tracking losses by increasing the signal power by 
the amount equal to the losses. This means that the level of interference will be increased 
which can be taken into account in our model by modifying the parameters a k i as follows: 
oc k i a u + 2s p Ja k , - a 2 kl 

19.1.5 Interference canceler modeling: nonlinear multiuser detectors 

For the system performance evaluation a model for the canceller efficiency is needed. Lin- 
ear multiuser structure might not be of much interest in the next generation of the mobile 
communication systems where the use of long codes will be attractive. An alternative ap- 
proach is nonlinear (multistage) multiuser detection, that would include channel estimation 
parameters too. This would be based on interference estimation and cancellation schemes 
(OKI standard-IS-665/ITU recommendation M.1073 or UMTS defined by ETSI) 

In general if the estimates of Equation (19.8) are denoted y, and y q , then the residual 
interference after cancellation can be expressed as 

Ay;=y,--ft, Ay 9 = y q -y q , Ay = Ay,- + j Ay q = Vec { Ay s } (19.22) 

where index g =>• k, I spans all combinations of k and /. By using Equation (19.22), each 
component 0^/(1 — C«) in Equation (19.16) can be obtained as a corresponding entry of 
Vec{|Ay ? | 2 }. 

To further elaborate these components we will use a simplified notation and analysis. 
After frequency downconversion and despreading, the signal from user k, received through 
path / at the receiver m, would have the form 

Smki = A mkl m k cos 6 mk , - (A m u + AA mkt ) m k cos (9 mk i + e g „ M ) (19.23) 

for a single signal component and 

S'mki = A mkim ki cos0 m w + Amkifhkq sin0 mjH ; 

Kki = - A mkim k i sin9 mk i + A mU m kq cos9 mk i (19.24) 

for a complex (I&Q) signal structure. In a given receiver m, components S' mkl and S q mkl 
correspond to Ay ik i and Ay qk i. Parameter A mk i includes both amplitude and correlation 
function. In Equation (19.23) AA mU and Se m ki are amplitude and phase estimation errors. 
The canceller would create S mk i — S mk i — AS mk i and the power of this residual error (with 
index m dropped for simplicity) would be 



E 0 [(ASu) 2 ] = Eg [Akim k cos6k, - (A k , + AAu)m k cos (6 k i + £ek)] 2 (19.25) 



EFFECTIVE CAPACITY OF ADVANCED CELLULAR NETWORKS 



737 



where Eg [ ] stands for averaging with respect to B u and m k . Parameter (ASm) 2 corresponds 
to | Ay n \ 2 . This can be represented as 

Eg [(AS U ) 2 ] = a« [1 + (1 + £a) 2 - 2(1 + s a ) ■ (1 - 2 £ „,)cose e ] (19 ' 26) 

From this equation we have 1 — Cm — Ee[_{ASki) 2 \/aki and Cm = 2(1 + s a )(l — 2e m ) 
cos s g — (1 + £ a ) 2 - Expanding cos eg as 1 — and averaging gives 

C« =2(1+ e a ) (1 - 2e m ) (l - a, 2 ) - (1 + £fl ) 2 (19.27) 

For zero mean eg ,<jg — £ |_ e e/2J is the carrier phase tracking error variance. For the complex 
(I&Q) signal structure cancellation efficiencies in I- and Q-channels can be represented as 

C[ = 4(1 + ej(l - 2e m ) (l + a 2 ) - 2(1 + s fl ) 2 - 1 

C« = 4(1 + e«)(l - 2e m ) (l + a 2 ) - 2(1 + s fl ) 2 - 1 (19.28) 

So, in this case the canceller efficiency is expressed in terms of amplitude, phase and data 
estimation errors. These results should be now used for analysis of the impact of large-scale 
channel estimators on overall CDMA network sensitivity. The performance measure of any 
estimator is parameter estimation error variance that should be directly used in Equations 
(19.28) for cancellation efficiency and Equations (19. 19)— (19.22) for the rake receiver. If 
joint parameter estimation is used, based on ML criterion, then the Cramer-Rao bound could 
be used for these purposes. For the Kalman type estimator, the error covariance matrix is 
available for each iteration of estimation. If each parameter is estimated independently, 
then for carrier phase and code delay estimation error a simple relation a e 2 = 1/SNRl can 
be used where SNR L is the signal-to-noise ratio in the tracking loop. For the evaluation 
of this SNR L the noise power is in general given as N — B L No- For this case, the noise 
density A^o is approximated as a ratio of the overall interference plus noise power divided 
by the signal bandwidth. The loop bandwidth will be proportional to f D where / D is the 
fading rate (Doppler). The higher the / D , the higher the loop noise bandwith, the higher the 
equivalent noise power (A^o/d)- If interference cancellation is performed prior to parameter 
estimation, is obtained from /( ) defined by Equation (9.16). If parameter estimation 
is used without interference cancellation the same /( ) is used with Cm — 0. In addition to 
this 

A-A(l-e x ) AA + Ae T 

Sa^ J - = ~ A "J Sa =>£A+£r(l-£A) (19.29) 

where s T is the code delay estimation error and sa = (A — A)/ A — 1 — A/ A. For nonco- 
herent estimation we have 



where /o( ) and /i( ) are the zeroth- and first- order Bessel functions, respectively, and y is 
the signal-to-noise ratio. 
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19.1.6 Approximations 

If we assume that the channel estimation is perfect (s a — s g — 0) the parameter C m u 
becomes Cm = 2(1 — 2e m ) —1 = 1— 4s m . For DPSK modulation e m = (l/2)exp(— y/2) 
where y is signal-to-noise ratio and for CPSK e„, = (l/2)erfc(^/y). So Cu — 1 — 2e~- v for 
DPSK, and C a = 1 - 2 erfc(./y) for CPSK. For large y,C a =^-l and for small y in DPSK 
system we have e~ y = 1 — y and Cjw = 2y — 1. This can be presented as C« = 27;, — 1, 
where Yf, is given by Equation (19.14). Bearing in mind that Yb depends on C« the whole 
equation can be solved through an iterative procedure starting up with an initial value of 
Cki — 0, Vm, k, I. Similar approximations can be obtained for Og and s a . From practical 
point of view an attractive solution could be a scheme that would estimate and cancel only 
the strongest interference (e.g. successive interference cancellation schemes [2]). 



19.1.7 Outage probability 

The previous section already completely defines the simulation scenario for the system 
performance analysis. For the numerical analysis further assumptions and specifications 
are necessary. First of all we need the channel model. The exponential multipath intensity 
profile (MIP) is widely used analytical model realized as a tapped delay line [2]. It is very 
flexible in modeling different propagation scenarios. The decay of the profile and the number 
of taps in the model can vary. Averaged power coefficients in the multipath intensity profile 
are 

ai = ofie- u l,X>0 (19.31) 

where X is the decay parameter of the profile. Power coefficients should be normalized as 
Ylf=o <*o e_W = 1 ■ F° r A = 0 the profile will be flat. The number of resolvable paths depends 
on the channel chip rate and this must be taken into account. We will start from Equation 
(19.14) and look for the average system performance for p 2 — l/G where G — W/R/, is 
the system processing gain and W is the system bandwidth (chip rate). The average signal 
to noise ratio will be expressed as 

— r (Lo) G 

Y h = (19.32) 

f(oc)K + g 0 ri'W/S 

Now, if we accept some quality of transmission, BER = 10~ e , that can be achieved with the 
given SNR — Yq, then with the equivalent average interference density t]q — I 0 [ c + I 0 \ n + rj t h 
SNRwillbe 

r (L 0 )Q 

Y 0 = (19.33) 

m 

To evaluate the outage probability P out we need to evaluate [2] 

P out = Pr (BER > l(T e ) = Pr ^MAI + ^ > f? 0 ) 

= PrMVIAI > 77o- — ) = Pr(MAI > S) (19.34) 
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where S is given as 

_ r {La) G rjW 

It can be shown that this outage probability can be represented by Gaussian integral 

(8 — rria\ 
—^j (19-35) 

where m g and a g are the mean value and the standard variance respectively of the overall 
interference. From Equation (19.32) we have for the system capacity K with ideal system 
components 

K mm = Try— - soW/SMa) (19.36) 

Owing to imperfections in the operation of the rake receiver and interference canceller, this 
capacity will be reduced to 

r (Lo) G 

K ' = v~n~\ ~ sWW/Sf(a) (19.37) 
Y 0 f(a) 

where r ( Q L °^ and fo(a) are now replaced by real parameters r (L ^ and f(a) that take into 
account those imperfections. The system sensitivity function is defined as 



„ -^max — K' 1 

ut — = 



Y 0 f(a)f 0 (a) Sf(a)Ma) 



(19.38) 



where Ar (i °' = tf 0> f(a) - r (Lo) / 0 (a) and A/ (a) = f(a) - f 0 (a). 



19.1.7.1 Performance example 

In this section we present some numerical results for illustration purposes. The results are 
obtained for a channel with double exponential (space and delay) profile with decay factors 
k„ and X, . Graphical results are presented with: solid line, 4x4 rake; dashed line, 4x1 rake. 
In the case of a 3 dB approximation of the real antenna, the beam forming is approximated 
by the rectangular shape of the antenna pattern in the range — 30° (for p c = 1), and 
uniform distribution of A^r« and A<p k/ ([0, 7t], [0, 2 it], respectively) , X s — X, — 0 if not 
specified otherwise, Yq — 2, L — 4, interference margin SNR = (20 * ao/alfajnean)" 1 . 

The users are uniformly distributed in the sphere (<p, \[r) with <p, \[r e (0, 360°). The 
results can be easily scaled down for a more realistic scenario where <p e (0, 360°) and 
if G (V'min, ^max)- The canceling efficiency for maximum capacity is calculated by as- 
suming no estimation errors in Equation (19.27). When estimation errors are included, 
canceling efficiencies are given by Equation (19.27). Carrier phase tracking error variance 
is assumed to be — 1/SNR L . For MRC the amplitude estimation error is approximated 
from Equation (19.30) to follow s a = 1/4 SNR L . For real antennas the amplitude patterns 
are specified in Figure 19.2. The base station amplitude antenna pattern is given as 

M<p, f) = ex P I ~ n \p c sin ^ cos (f ~ 2jt ~Kf) I 



740 NETWORK INFORMATION THEORY 





Figure 19.2 Peak-amplitude pattern A(<p, \fr) for nonsinusoidal Gaussian pulses received 
by the circular array with N — 4 elements and (a) p c — 1; (b) p c = 3; (c) 
p c = 6; (d) p c = 12. 



and four examples cover a relatively wide range of shapes, from a wide lobe for p c — 1 
to a rather narrow lobe for p c — 12. At mobile stations an omnidirectional antenna pattern 
A(<p, xjr) = 1 is assumed. Capacity curves, defined as the number of users with data rate 
R = chip rate/G = 4.096/ G, for different antenna patterns, are shown in Figure 19.3. The 
highest capacity is obtained for p c — 12 because with the narrowest lobe the spatial division 
multiple access effect is the most effective. With increased receiver velocity the capacity will 
be reduced due to the increased effect of imperfections. A comparison of the systems using 
ideal and real antennas is shown in Figure 19.4. Figure 19.4 presents the system capacity 
vs G. In general higher G means more users in the network and more MAI resulting in more 
impact of imperfections. A 4 x 4 rake performs better for lower G but for higher G (more 
users) it deteriorates faster. The degradation is more severe for higher receiver velocities. 
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Figure 19.3 Capacity vs receiver velocity for EGC for different antenna patterns 
A(q>, f). N = 4, 4 x 4 rake, G = 256, X, = X s — 0,Y 0 — 2, L — 4, SNR = 
(20*ao/alpha_mean)~ 1 • 




G 

Figure 19.4 Capacity vs processing gain for EGC. a, v lec — 200 km/h, b, v mc — 150 km/h, 
c, t> rec = 100 km/h; <f, i> rec = 50 km/h. Solid line: 4x4 rake; dashed line: 
4x1 rake; A real antenna pattern of circular array at base station; B , 3 dB 
aproximation (i/^dB = 30°) of the real antenna pattern; p c = 1; N = 4, X s = 
X, = 0, y 0 = 2, L = 4, SNR = (20*a 0 /alpha_meanr 1 . 

Figure 19.5 represents the same results as a function of the receiver velocity. The system 
sensitivity function defined by Equation (19.38) is shown in Figure 19.6. Sensitivity equal to 
1 means that all capacity has been lost due to imperfections. Figure 19.6 demonstrates that 
very high values for the system sensitivity, even in the range close to 0.9, can be expected 
if a large number of users (low data rate corresponding to high G) are in the network. 

In this section we have presented a systematic analytical framework for the capacity 
evaluation of an advanced CDMA network. This approach provides a relatively simple way 
to specify the required quality of a number of system components. This includes multiple ac- 
cess interference canceller and rake receiver, taking into account all their imperfections. The 




Figure 19.5 Capacity vs the receiver velocity for EGC; solid lines, 4x4 rake; dashed 
lines, 4x1 rake. A, real antenna; B, 3 dB approximation of the real antenna. 
p c = 1; N = 4, k, = k s = 0, Y 0 = 2, L = 4, SNR = (20*a 0 /alpha_mean)- 1 . 
(a) a,G — 256; b,G — 160; (b) a,G — 80; b,G — 48; c, G = 40. 



system performance measure is the network sensitivity function representing the relative 
losses in capacity due to all imperfections in the system implementation. Some numerical 
examples are presented for illustration purposes. These results are obtained for a channel 
with double exponential (space and delay) profile. It was shown that for the receiver velocity 
100-200 km/h as much as 70-90 % of the system capacity can be lost due to the imperfec- 
tions of the three-dimensional rake receiver and interference cancellation operation, variety 
of results are presented for different channel decay factor, fading rate and number of rake 
fingers. In general, under ideal conditions, the system capacity is increased if the number 
of fingers is increased. At the same time one should be aware that the system sensitivity 
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Figure 19.6 Sensitivity vs processing gain for EGC. a, i> rec — 200 km/h; b, i> rec = 150 
km/h; c, v rec = 100 km/h; d, v lec = 50 km/h. Solid line, 4x4 Rake; dashed 
line; 4x1 rake. A, real antenna pattern of circular array at base station; B , 3 dB 
aproximation (V^b = 30°) of the real antenna pattern. p c = 1; N = 4, X s = 
X, = 0, Y 0 = 2, L = 4, SNR = (20*a 0 /alphajneanT 1 . 

is also increased if the fading rate and number of rake fingers are higher. The results and 
methodology presented in this section offer enough tools and data for the careful choice of 
the system parameters in realistic environments which are characterized by imperfections. 



19.2 CAPACITY OF AD HOC NETWORKS 

In this section we now discuss the capacity of wireless networks. The discussion is based 
on concepts presented in References [32, 33]. In an ad hoc network, it is supposed that 
n nodes are located in a region of area 1 m 2 . Each node can transmit at W b/s over a 
common wireless channel. The channel in general may be broken up into several subchan- 
nels of capacity W\, Wi, . . . , Wm b/s. This will be immaterial for the final results as long 
as J2m=i W m — W. Packets are sent from node to node in a multihop fashion until they 
reach their final destination. They can be buffered at intermediate nodes while awaiting 
transmission. Owing to spatial separation, several nodes can make wireless transmissions 
simultaneously, provided there is no excessive interference from others. In the sequel we will 
discuss the conditions under which a wireless transmission over a subchannel is received 
successfully by its intended recipient. 

Two types of networks are considered, Arbitrary networks, where the node locations, 
destinations of sources, and traffic demands, are all arbitrary, and Random Networks, where 
the nodes and their destinations are randomly chosen. 

19.2.1 Arbitrary networks 

In the arbitrary setting we suppose that n nodes are arbitrarily located in a disk of unit area 
in the plane. Each node chooses an arbitrary destination to which it wishes to send traffic at 
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an arbitrary rate, so that the traffic pattern is arbitrary too. Each node can choose an arbitrary 
range or power level for each transmission. 

To define when a transmission is received successfully by its intended recipient we will 
allow for two possible models for successful reception of a transmission over one hop, 
called the protocol model and the physical model. Let X t denote the location of a node; we 
will also use X, to refer to the node itself. 

19.2. 1. 1 The protocol model 

Suppose node X, transmits over the mth subchannel to a node Xj. In this case transmission 
is successfully received by node Xj if — Xj\ > (1 + A)|X,- — X j\. In this case a guard 
zone A > 0 is specified by the protocol to prevent a neighboring node from transmitting 
on the same subchannel at the same time. 

19.2. 1.2 The physical model 

Let [X k ;k € T] be the subset of nodes simultaneously transmitting at some time instant 
over a certain subchannel. Let be the power level chosen by node X^ forkeT. In 
this case the transmission from a node Xj, i e T , is successfully received by a node X j if 
S/I > P, where 

S = P/\Xi-Xj\ a 

and 

I =N + J2 p /\ X >- X j\ a 

fair 

This models a situation where a minimum signal-to-interference ratio (SIR) of p is necessary 
for successful receptions, the ambient noise power level is N, and signal power decays with 
distance r as l/r°. For a model outside a small neighborhood of the transmitter, a > 2. 

19.2. 1.3 The transport capacity of arbitrary networks 

In this contest we say that the network transports one bit-meter (b-m) when 1 b has been 
transported a distance of 1 m towards its destination. (We do not give multiple credit for 
the same bit carried from one source to several different destinations as in the multicast or 
broadcast cases.) This sum of products of bits and the distances over which they are carried 
is a valuable indicator of a network's transport capacity Cj. (It should be noted that, when 
the area of the domain is A square meters rather than the normalized 1 m 2 , then all the 
transport capacity results presented below should be scaled by *JA.) By using the notation: 
/(«) - @[g(n)] when f(n) — 0[gin)] as well as gin) — O [/(«)], we will show later in 
the section that the transport capacity of an arbitrary network under the protocol model 
is C T = ®(W *Jri) b-m/s if the nodes are optimally placed, the traffic pattern is optimally 
chosen, and if the range of each transmission is chosen optimally. An upper bound is 
Cj — v /(8/tt)(W / /A)^/m b-m/s for every arbitrary network for all spatial and temporal 
scheduling strategies, while Cj — Wn/[(l + 2 A) (*/n + V&jt)] bit-meters per second (for 



CAPACITY OF AD HOC NETWORKS 



745 



n a multiple of four) can be achieved when the nodes and traffic patterns are appropriately 
chosen, and the ranges and schedules of transmissions are appropriately chosen. 

If this transport capacity were to be equally divided between all the n nodes, then each 
node would obtain &(W/ *Jri) b-m/s. If, further, each source has its destination about the 
same distance of 1 m away, then each node would obtain a throughput capacity of @(W / -Jn) 
b/s. 

The upper bound on transport capacity does not depend on the transmissions being 
omnidirectional, as implied by Equation (19. 1), but only on the presence of some dispersion 
in the neighborhood of the receiver. It will be shown later in the section that, for the 
physical model, cW^/n b-m/s is feasible, while c'Wn a ~ l/a b-m/s is not, for appropriate 
c, c' . Specifically, 

C T = Wn/ | (yn + V^r) {l6/3 [2! + 6 a " 2 / (a - 2)j } ^ j b-m/s (for n a multiple of 4) 
is feasible when the network is appropriately designed, while an upper bound is 

C T = [(2yS + 2) /p] « Wn V / b-m/s 

It is suspected that that an upper bound of order @(W \Jn) b-m/s may actually hold. In the 
special case where the ratio P max /P m i n between the maximum and minimum powers that 
transmitters can employ is bounded above by p, then an upper bound is in fact 



C T = (Wy/&n/7t)/ 



(pPmm/Pmax)" ~ 1 



b-m/s 



Both bounds suggest that transport capacity improves when a is larger, i.e. when the signal 
power decays more rapidly with distance. 



19.2.2 Random networks 

In this case, n nodes are randomly located, i.e. independently and uniformly distributed, 
either on the surface S 2 of a three-dimensional sphere of area 1 m 2 , or in a disk of area 1 m 2 
in the plane. The purpose in studying S 2 is to separate edge effects from other phenomena. 
Each node has a randomly chosen destination to which it wishes to send X(n) b/s. The 
destination for each node is independently chosen as the node nearest to a randomly located 
point, i.e. uniformly and independently distributed. (Thus destinations are on the order 
of 1 m away on average.) All transmissions employ the same nominal range or power 
(homogeneous nodes). As for arbitrary networks, both a protocol model and a physical 
model are considered. 



19.2.2. 1 The protocol model 

All nodes employ a common range r for all their transmissions. When node Xj transmits 
to a node Xj over the mth subchannel, this transmission is successfully received by Xj 
if \Xj — X j\ < r and for every other node X^ simultaneously transmitting over the same 
subchannel \X k -Xj\ > (1 + A)r. 
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19.2.2.2 The physical model 

All nodes choose a common power level P for all their transmissions. Let {X/,; k e T] be the 
subset of nodes simultaneously transmitting at some time instant over a certain subchannel. 
A transmission from a node X, , e T, is successfully received by a node X j if S/I > fJ , 
where 

S = P/\Xi-Xj\ a 

and 

I =N + J2 p /\ x i- x j\" 

19.2.2.3 The throughput capacity of random networks 

The throughput is defined in the usual manner as the time average of the number of bits per 
second that can be transmitted by every node to its destination. 

19.2.2.4 Feasible throughput 

A throughput of X(n) b/s for each node is feasible if there is a spatial and temporal scheme 
for scheduling transmissions, such that by operating the network in a multihop fashion and 
buffering at intermediate nodes when awaiting transmission, every node can send X(n) b/s 
on average to its chosen destination node. That is, there is a T < oo such that in every time 
interval [(;' — l)T, iT] every node can send TX(n) b to its corresponding destination node. 

19.2.2.5 The throughput capacity of random wireless networks 

We say that the throughput capacity of the class of random networks is of order ©[/(«)] 
b/s if there are deterministic constants c > 0 and d < +oo such that 

lim Prob [X(n) — cf(n) is feasible] = 1 

liminf Prob [X(n) — c'f(n) is feasible] < 1 

n— > oo 

It will be shown in the next section that, in the case of both the surface of the sphere and 
a planar disk, the order of the throughput capacity is X(n) — ®[W/*J(n log«)] b/s for the 
protocol model. For the upper bound for some c', 

lim Prob X{n) — c'W/Jn log« is feasible le = 0 

fl^OO L J 

Specifically, there are deterministic constants c" and d" not depending on n, A or W, such 
that k(n) = c"W/ [(1 + A)V(« logn)] b/s is feasible, and X(n) = d"W / [A 2 VO log«)] 
b/s is infeasible, both with probability approaching 1 as n — > oo. 

It will be also shown that, for the physical model, a throughput of X{n) — cW / +J(n log ri) 
b/s is feasible, while X(n) — c'W /^Jn bits per second is not, for appropriate c, d , both with 
probability approaching 1 as n — > oo. Specifically, there are deterministic constants c" and 
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d" not depending onn,N,a,fi or W, such that 



X(n) = d'W/ 



V«log« (l{d"p [3 + 1/ (a - l) + 2/(a-2)]}« - l) 



b/s 



is feasible with probability approaching one as n — > oo. If L is the mean distance between 

two points independently and uniformly distributed in the domain (either surface of sphere 

or planar disk of unit area), then there is a deterministic sequence e(«) — > 0, not depending 

|~_ j_ 1 

on AT, a, P or W, such that (<JS/nn) W [1 + e(n)] / L(j6« - 1) 
probability approaching 1 as n — > oo. 



b-m/s is infeasible with 



19.2.3 Arbitrary networks: an upper bound on transport capacity 

The following assumptions for the setting on a planar disk of unit area are used [32, 33]: 
(al) There are n nodes arbitrarily located in a disk of unit area on the plane. 
(a2) The network transports XnT b over T s. 

(a3) The average distance between the source and destination of a bit is L . Together 
with (a2), this implies that a transport capacity of XnL b-m/s is achieved. 

(a4) Each node can transmit over any subset of M subchannels with capacities W m b/s, 
1 < m < M, where Yl=\ w m = W. 

(a5) Transmissions are slotted into synchronized slots of length r s. (This assumption 
can be eliminated, but makes the exposition easier.) 

(a6) Definitions of physical model and protocol model from the previous section are 
used. 

While retaining the restriction for the case of the physical model, we can either retain the 
restriction in the protocol model or consider an alternate restriction as follows: if a node 
Xi transmits to another node Xj located at a distance of r units on a certain subchannel in 
a certain slot, then there can be no other receiver within a radius of Ar around Xj on the 
same subchannel in the same slot. This alternative restriction addresses situations where the 
transmissions are not omnidirectional, but there is still some dispersion in the neighborhood 
of the receiver. Under the above assumptions the following results for the transport capacity 
have been obtained [32, 33]: 

(rl) In the protocol model, the transport capacity XnL is bounded as: XnL < W *J%n /it 
A b-m/s. 

(r2) In the physical model, XnL < Wn a -^ a [(2)8 + 2) /P] 1/a j^Jn b-m/s. 

(r3) If the ratio P m ^l Pmm between the maximum and minimum powers that trans- 
mitters can employ is strictly bounded above by fi, then XnL < W*J(8n/7t) 

[QS/W^nax) 17 " " I]"' b-m/s. 

(r4) When the domain is of A square meters rather than 1 m 2 , then all the upper bounds 
above are scaled by ~J~A. 
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To prove the above results, consider bit b, where 1 < b < XnT . Let us suppose that it moves 
from its origin to its destination in a sequence of h(b) hops, where the hth hop traverses a 
distance of r*. Then from (a3) we have 

XnT h(b) 

J2J2 r b - XnTL (19.39) 

b=l h=\ 

Having in mind that in any slot at most n /2 nodes can transmit for any subchannel in and 
any slot i we have 

W m xn 

y y 1 (the hth hop of bit b is over subchannel m in slot i ) < — - — 

b=\ h=\ 

Summing over the subchannels and the slots, and noting that there can be no more than Tlx 
slots in T s yields 

4^ WTn 
H: = 2J h{b)< — (19.40) 

b=\ 2 

From the triangle inequality and (a6), for the protocol model, where Xj is receiving a 
transmission from Xj over the mth subchannel and at the same time X t is receiving a 
transmission from X k over the same subchannel, we have 

\Xj - X t \ > \Xj - X k \ - \X t - X k \ > (1 + A)|X,- -Xj\- \X t - X k \ 

Similarly, 

\X t ~ Xj\ > (1 + A)\X k -X t \- \Xj - Xi\ 
Adding the two inequalities, we obtain 

\X t -Xj\>^(\X k -Xt\+\Xi-Xj\) 

This means that disks of radius A /2 times the lengths of hops centered at the receivers over 
the same subchannel in the same slot are essentially disjoint. This conclusion also directly 
follows from the alternative restriction in (a6). Allowing for edge effects where a node is 
near the periphery of the domain, and noting that a range greater than the diameter of the 
domain is unnecessary, we see that at least a quarter of such a disk is within the domain. 
Having in mind that at most W m x bits can be carried in slot from a receiver to a transmitter 
over the mth subchannel, we have 

InT h(b) / 1 \ / A\ 2 

1 (the hth hop of bit b is over subchannel m in slot s)x I - J n I — J (r^) 2 <W m x 

(19.41) 

Summing over the subchannels and the slots gives 

XnT Kb) A 2 

EE^ h f<wT 

b=l h=\ iu 
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XnT h(b) , ]6WT 



or equivalently 

6=1 h=l 

Since the quadratic function is convex we have 



(19.42) 



I XnT h(b) , \ 2 XnT fc(6) -, 



Finally combining Equations (19.64) and (19.65) yields 



6=16=1 ' 71 A " 

and substituting Equation (19.40) in Equation (19.66) gives 



,16WTH 



16WTH 

XnTL<J -j- (19.45) 

V it A 2 

Substituting Equation (19.63) in Equation (19.67) yields the result (r 1). 

For the physical model suppose X, is transmitting to X jy) over the mth subchannel at 
power level P t at some time, and let 3 denote the set of all simultaneous transmitters over 
the mth subchannel at that time. The initial constraint introduced by (a6) can be represented 
as 

7 = > P (19.46) 

N+ > 

^ \X k -Xj\° 

keS 
k^i 

By also including the signal power of X t in the denominator, the signal-to-interference 
requirement for X j^) can be written as 

Pi 

S __ \x,-x m \- P 



which results in 



\Xi - X < ^ < 



f^\X k -X m \" keS 



(since \X k - X m \ < 2/*Jn). 
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Summing over all transmitter-receiver pairs, 

1 + 1 



X>i-x,-(ol a < 



Ze3 



ifg < 2 a 7T~ (a/2) - 



ie3 



Summing over all slots and subchannels gives 



JUT 



6=1 /!=1 



: + 1 
/J 



The rest of the proof proceeds along lines similar to the protocol model, invoking the 
convexity of r 01 instead of r 2 . For the consideration of the special case where P max / P m \„ < fi , 
we start with Equation (19.46). From it, it follows that if X, is transmitting to Xj at the 
same time that X k is transmitting to X t , both over the same subchannel, then 



\X t -Xj\« 



Thus 



\X k -Xj\> (J3P min /P mm )« \Xi -Xj\=(l + A)\Xi - Xj 



where A := (PP m m/P m nx) a — F Thus the same upper bound as for the protocol model 
carries over with A defined as above. 



19.2.4 Arbitrary networks: lower bound on transport capacity 

There is a placement of nodes and an assignment of traffic patterns such that the network 
can achieve 



Wn/{(1 +2A)(V« + V8?r)} b-m/s 



under the protocol model, 



Wn/ 



/ . \ r / a 6°>- : 

(^V« + V8^rj 16;S(22+ 



-l/a 



b-m/s 



under the physical model, both whenever n is a multiple of 4. To prove it, consider the 
protocol model and define 



r := 1/ j(l+2A) (7^4 + V2^)} 



Recall that the domain is a disk of unit area, i.e. of radius 1 / *Jtt in the plane. With the center 
of the disk located at the origin, place transmitters at locations + 2A)r ± Ar, k(l + 
2A)r] and [j(l + 2A)r, k{\ + 2A)r ± Ar] where \ j + k\ is even. Also place receivers at 
[7(1 + 2A)r ± Ar, k(\ + 2A)r] and + 2A)r, k(l + 2A)r ± Ar] where \j + k\ is odd. 
Each transmitter can transmit to its nearest receiver, which is at a distance r away, without 
interference from any other transmitter-receiver pair. It can be verified that there are at 
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least n/2 transmitter-receiver pairs, all located within the domain. This is based on the 
fact that for a tessellation of the plane by squares of side s, all squares intersecting a disk 
of radius R — *J2s are entirely contained within a larger concentric disk of radius R. The 
number of such squares is greater than jt(R — */2s) 2 /s 2 . This proves the above statement 
for s = ( 1 + 2 A)r and R — 1 I^Jn . Restricting attention to just these pairs, there are a total 
of n/2 simultaneous transmissions, each of range r, and each at W b/s. This achieves the 
transport capacity indicated. 

For the physical model, a calculation of the SIR shows that it is lower-bounded at all 
receivers by 



x-2 \ i 



16(22 + 



(1 + 2Af 



Choosing A to make this lower bound equal to fJ yields the result. 

In the protocol model, there is a placement of nodes and an assignment of traffic patterns 
such that the network can achieve 



2W I \fn b-mls for n > 2 
4W + A)] -1 b-mls, for n > 8 

Wn (1 +2A)(V« + V8tt) 

4 L«/4J W {(1 + 2A) (V4 L«/4J + V8tt) } ' b-mls, for all n 



b-mls, for n = 2, 3, 4, • • • , 19, 20, 21 and 
l 



With at least two nodes, clearly 2W /^Jn b-m/s can be achieved by placing two nodes at dia- 
metrically opposite locations. This verifies the formula for the bound for n < 8. With at least 
eight nodes, four transmitters can be placed at the opposite ends of perpendicular diameters, 
and each can transmit toward its receiver located at a distance 1/ [v / ?r(2 + 2 A)] towards 
the center of the domain. This yields 4W / [v^(l + A)] b-m/s, verifying the formula up to 
n = 21. 



19.2.5 Random networks: lower bound on throughput capacity 

In this section we show that one can spatially and temporally schedule transmissions in a 
random graph so that, when each randomly located node has a randomly chosen destina- 
tion, each source-destination pair can indeed be guaranteed a 'virtual channel' of capacity 
cW [(1 + A) 2 V« logw] b/s with probability approaching 1 as n — > oo, for an appropriate 
constant c > 0. We will show how to route traffic efficiently through the random graph so 
that no node is overloaded. 



19.2.5. 1 Spatial tessellation 

In the following a Voronoi tessellation of the surface S 2 of the sphere is used. For a set of p 
points {«i , c?2, • • • , a p ) on S 2 the Voronoi cell V (a, ;) is the set of all points which are closer 
to a, than to any of the other ajs, i.e. [34], 

V{di) :— {x G S 2 : \x — a,-| = min \x — 

i<j<p 
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Figure 19.7 A tessellation of the surface S 2 of the sphere. 

Above and throughout, distances are measured on the surface S 2 of the sphere by segments 
of great circles connecting two points. The point a, is called the generator of the Voronoi cell 
V(aj). The surface of the sphere does not allow any regular tessellation where all cells look 
the same. For our application Voronoi tessellations will also need to be not too eccentrically 
shaped. Therefore, the properties of Voronoi tessellations needed in this section can be 
summarized as: 

(vl) For every e > 0, there is a Voronoi tessellation of S 2 with the property that every 
Voronoi cell contains a disk of radius e and is contained in a disk of radius 2s (see 
illustration in Figure 19.7). 

To prove this we denote by D(x, e) a disk of radius e centered at x. Choose a\ as any point 
in S 2 . Suppose that a\, ■ ■ ■ ,a p have already been chosen such that the distance between 
any two OjS is at least 2s. There are two cases to consider. Suppose there is a point x such 
that D(x, s) does not intersect any D(a,, s). Then x can be added to the collection: Define 
a p+ \ ;— x. Otherwise, we stop. This procedure has to terminate in a finite number of steps 
since the addition of each a, removes the area of a disk of radius s > 0 from S 2 . When we 
stop we will have a set of generators such that they are at least 2s units apart, and such that 
all other points on S 2 are within a distance of 2s from one of the generators. The Voronoi 
tessellation obtained in this way has the desired properties. 
In the sequel we will use a Voronoi tessellation V„ for which 

(v2) Every Voronoi cell contains a disk of area 100 log«/«. 
Let p(n) :— radius of a disk of area (100 log n) /n on S 2 . 

(v3) Every Voronoi cell is contained in a disk of radius 2p(n). 
We will refer to each Voronoi cell V € V„ as simply a 'cell'. 
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19.2.5.2 Adjacency and interference 

By definition, two cells are adjacent, if they share a common point. If we choose the range 
r (n) of each transmission so that r(ri) — 8p(«), this range will allow direct communication 
within a cell and between adjacent cells. 

(v4) Every node in a cell is within a distance r(n) from every node in its own cell or 
adjacent cell. 

To prove it let we notice that the diameter of cells is bounded by 4p(«), see (v3). The range 
of a transmission is 8p(«). Thus the area covered by the transmission of a node includes 
adjacent cells. 

19.2.5.3 Interfering neighbors 

By definition we say that two cells are interfering neighbors if there is a point in one cell 
which is within a distance (2 + A)r(n) of some point in the other cell. In other words if two 
cells are not interfering neighbors, then in the protocol model a transmission from one cell 
cannot collide with a transmission from the other cell. 

19.2.5.4 Bound on the number of interfering neighbors of a cell 

An important property of the constructed Voronoi tessellation V n is that the number of 
interfering neighbors of a cell is uniformly bounded. This will be exploited in the next 
section in constructing a spatial transmission schedule which allows for a high degree of 
spatial concurrency and thus frequency reuse. 

(v5) Every cell in V n has no more than C\ interfering neighbors were C\ depends only 
on A and grows no faster than linearly in (1 + A) 2 . 

To prove it, Let V be a Voronoi cell. If V' is an interfering neighboring Voronoi cell, there 
must be two points, one in V and the other in V', which are no more than (2 + A)r(n) 
units apart. From (v3), the diameter of a cell is bounded by 4p(«). Hence V, and similarly 
every other interfering neighbor in the protocol model, must be contained within a common 
large disk D of radius 6p(n) + (2+ A)r(«). Such a disk D cannot contain more than 
t'2{[6p(«) + (2+ —)r(n)] 2 /p 2 (n)} disks of radius p(n). From (v2), there can therefore be 
no more than this number of cells within D. This then is an upper bound on the number of 
interfering neighbors of the cell V . The result follows from the chosen magnitudes of p(n) 
andr(«). 

19.2.5.5 A bound on the length of an all-cell inclusive 
transmission schedule 

The bounded number of interfering neighbors for each cell allows the construction of a 
schedule of bounded length which allows one opportunity for each cell in the tessellation 
V n to transmit. 

(v6) In the Protocol Model there is a schedule for transmitting packets such that in every 
(1 + Ci) slots, each cell in the tessellation V n gets one slot in which to transmit, 
and such that all transmissions are successfully received within a distance r(n) 
from their transmitters. 
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(v7) There is a deterministic constant c not depending on n, N, a, f3, or W such that if 
A is chosen to satisfy 

(1 + A) 2 > (l{cp [3 + (a - l)" 1 +2(a - 2)- 1 ]}" - 1^ 

then for a large enough common power level P , the above result (v6) holds even 
for the physical model. 

To prove it we show first the result for the protocol model. This follows from a well-known 
fact about vertex coloring of graphs of bounded degree. A graph of degree no more than c\ 
can have its vertices colored by using no more than (1 + C\) colors, with no two neighboring 
vertices having the same color [35]. One can therefore color the cells with no more than 
(1 + C\) colors such that no two interfering neighbors have the same color. This gives a 
schedule of length at most (1 + C\), where one can transmit one packet from each cell of 
the same color in a slot. 

For the physical model one can show that, under the same schedule as above, the required 
SIR of p is obtained if each transmitter chooses an identical power level P that is high 
enough, and A is large enough. From the previous discussion we know that any two nodes 
transmitting simultaneously are separated by a distance of at least (2 + A)r(n) and disks 
of radius (1 + A /2)r in) around each transmitter are disjoint. The area of each such disk is 
at least Cs7r(l + A /2) 2 r 2 («). (In the case of disks on the plane C3 = 1, but it is smaller for 
disks on the surface of the sphere.) 

Consider a node X, transmitting to a node Xj at a distance less than r(n). The signal 
power received at Xj is at least Pr~ a (n). Now we look at the interference power due to 
all the other simultaneous transmissions. Consider the annulus of all points lying within a 
distance between a and b from Xj. A transmitter within this annulus has the disk centered 
at itself and of radius (1 + A /2)r(n) entirely contained within a larger annulus of all points 
lying between a distance a — (1 + A /2)r(«) and b + (1 + A /2)r («). The area of this larger 
annulus is no more than 

c 4 7T {[b + (1 + A/2)r(«)] 2 - [a - (1 + A/2)r(«)] 2 } 

Each transmitter above 'consumes' an area of at least c^jt (1 + A/2) 2 r 2 («), as noted 
earlier. Hence the annulus of points at a distance between a and b from the receiver Xj 
cannot contain more than 



c A n{[b + (1 + |)r(«)] 2 -[«-(! + |)r(«)] 2 } 



A 



Ci7t{\ + -fr\n) 



transmitters. Also, the received power at Xj from each such transmission is at most P /a". 
Noting that there can be no other simultaneous transmitter within a distance (1 + A)r(n) 
of Xj, and taking a = k{\ + A /2)r(n) and b = (k + 1)(1 + f )r(n) for k = 1 , 2 , 3, • • • , 
we see that the SIR at Xj is lower-bounded by 

Pr-» JjV + [(* + 2) 2 - (k - l) 2 ] [c 3 *"(l + lrr a (n)]~ l J 

N 1 c 3 (l + A/2)« N lc« I 
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Since a > 2, the sum in the denominator converges, and is smaller than 9 + 3 (a— 1) 1 + 
6 (a — 2) _1 . When A is as specified and P — > oo, the lower bound on the SIR converges 
to a value greater than /3. Using similar arguments one can show that [32] 

(v8) Each cell contains at least one node. 

(v9) The mean number of routes served by each cell < CioV( w l°g w )- 
(vlO) The actual traffic served by each cell < c^X(n)^/(n log« whp). 

19.2.5.6 Lower bound on throughput capacity of random networks 

From (v6) we know that there exists a schedule for transmitting packets such that in every 
(1 + ci) slots, each cell in the tessellation V„ gets one slot to transmit, and such that each 
transmission is received within a range r(n) of the transmitter. Thus the rate at which each 
cell gets to transmit is W/(l + Ci) b/s. On the other hand, the rate at which each cell needs 
to transmit is with high probability (whp) less than csX(n)^n \ogn [see (vlO)]. With high 
probability, this rate can be accommodated by all cells if it is less than the rate available, 
i.e. if csX(n)+J(n \ogn) < W (1 + c'i)~'. Moreover, within a cell, the traffic to be handled 
by the entire cell can be handled by any one node in the cell, since each node can transmit 
at rate W b/s whenever necessary. In fact, one can even designate one node in each cell as 
a 'relay' node. This node can handle all the traffic needing to be relayed. The other nodes 
can simply serve as sources or sinks. 

We have proved the following theorem, noting the linear growth of c\ in (1 + A) 2 in 
(v5), and the choice of A in (v6) for the physical model. 

(v 1 1 ) For random networks on S 2 in the protocol model, there is a deterministic constant 
c > 0 not depending on n, A, or W, such that X(n) — cW {(1 + A) 2 ^/(n log «)} 
b/s is feasible whp. 

(v 1 2) For random networks on S 2 in the physical model, there are deterministic constants 
c' and c" not depending on n, N, a, fi, or W, such that 



X(n) = c'W 



(l{c"p[3 + (a- ir 1 + 2(a-2)- 1 ]}« - 1^ jn log« 



b/s 



is feasible whp. These throughput levels have been attained without subdividing 
the wireless channel into subchannels of smaller capacity. 



19.3 INFORMATION THEORY AND NETWORK ARCHITECTURES 
19.3.1 Network architecture 

For the optimization of ad hoc and sensor networks we will discuss some performance 
measure for the architectures shown in Figures 19.8-19.1 1. We will mainly focus on trans- 
port capacity Ct := supj^j Ripi, where the supremum is taken over m, and vectors 
(Ri, 7?2, • • • , R m ) of feasible rates for m source-destination pairs, and pi is the distance 
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Figure 19.8 A planar network: n nodes located on a two-dimensional plane, with minimum 
separation distance p m \„. 

(1,V») (2,Vn) {4n,4n) 



(1,1) (2,1) (^,1) 

Figure 19.9 A regular planar network: n nodes located on a plane at (/, j) with 1 < /, j < 
«Jn. (Reproduced by permission of IEEE [33].) 

between the /th source and its destination. For the planar network from Figure 19.8 we 
assume [33]: 

(1) There is a finite set N of n nodes located on a plane. 

(2) There is a minimum positive separation distance p min between nodes, i.e. p min := 
min,^y pij > 0, where pij is the distance between nodes i, j e N. 

(3) Every node has a receiver and a transmitter. At time instants t — 1,2,..., node 
i e N sends Xj(t), and receives Y, (t) with 

e™JXj(t) 



EC" 'A Ul 



where Z, (0, / € N, t = 1, 2, . . . are Gaussian independent and identically dis- 
tributed (i.i.d.) random variables with mean zero and variance a 1 . The constant 
S > 0 is referred to as the path loss exponent, while y > 0 will be called the 
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Pij D Pmin 
■* * 

_Q Q Q Q Q D Q D D Q_ 

i j 

Figure 19.10 A linear network: n nodes located on a line, with minimum separation distance 
Pmin- (Reproduced by permission of IEEE [33].) 

Q Q Q Q Q Q Q Q Q Q Q Q Q_ 

12 i i+l n 

Figure 19.11 A regular linear network: n nodes located on a line at 1, 2, . . . , n. (Reproduced 
by permission of IEEE [33].) 

absorption constant. A positive y generally prevails except for transmission in a 
vacuum, and corresponds to a loss of 20y log 10 e decibel per meter. 

(4) Denote by Pj > 0 the power used by node i. Two separate constraints on 
{Pi, P2, ■ ■ ■ , P n ) are studied: total power constraint P totd \ : X7=i Pi — ^totai or in- 
dividual power constraint Pj^ : Pi < Pi n d, for / = 1, 2, . . . , n. 

(5) The network can have several source-destination pairs (si, dt), I — 1, ... ,m, 
where s t , d t are nodes in N with s e ^ dt, and (s t , d t ) ^ (sj, dj) for I ^ j. If 
m = 1, then there is only a single source-destination pair, which we will simply 
denote (s, d). 

A special case is a regular planar network where the n nodes are located at the points (;', j) 
for 1 < i, j < *Jn\ see Figure 19.9. This setting will be used mainly to exhibit achievability 
of some capacities, i.e. inner bounds. Another special case is a linear network where the 
n nodes are located on a straight line, again with minimum separation distance p mm ; see 
Figure 19.10. The main reason for considering linear networks is that the proofs are easier 
to state and comprehend than in the planar case, and can be generalized to the planar case. 
Also, the linear case may have some utility for, say, networks of cars on a highway, since 
its scaling laws are different. 

A special case of a linear network is a regular linear network where the n nodes are 
located at the positions 1, 2, see Figure 19.1 1. This setting will also be used mainly 
to exhibit achievability results. 



19.3.2 Definition of feasible rate vectors 



Definition 1 

Consider a wireless network with multiple source-destination pairs (si,di),£ — 1, . . . , m, 
with si 7^ dg, and (si, dt) ^ (sj, dj) for I ^ j. Let S := {st, I — 1, . . . , m] denote the set 
of source nodes. The number of nodes in S may be less than m, since we allow a node 
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to have originating traffic for several destinations. Then a [(2 TRl , . . . , 2 TRm ), T, P^] code 
with total power constraint P toVd i consists of the following: 

(1) m independent random variables Wi (transmitted words TRi bits long) with P(Wi = 
ki) = l/2 TRl , for any ki G {1,2,..., 2™*}, l=l,...,m. For any i e S, let W t := 
{W e : s t = i} and R, := £ 

{*:*«=<} 

(2) Functions /,■ , : M'" 1 x {1, 2, . . . , 2 TR <] -> R, f = 1, 2, . . . , T, for the source 
nodes e 5 and /y , : K' _1 — > R, f = 2, . . . , T, for all the other nodes j ^ 5, such 
that 

*i(0 = fuViQ), . . . , Yi(t - 1), Wi), t=\,2,...,T 
Xj(l) = 0, Xj(t) = fjAYj(\),...,Yj{t-\)), f=2,3,...,r 

such that the following total power constraint holds: 

1 T 

/=1 ieJV 

(3) m decoding functions 

gt :R T x {l,2,...,|M/ d J}^ {l,2,...,2 raf } 

for the destination nodes of the m source-destination pairs {{si,di),l — 
1, . . . , m}, where \W dt \ is the number of different values W de can take. Note that 
Wd t may be empty. 

(4) The average probability of error: 

Pf> := Prob [(W u W 2 ,..., W m ) ^(W U W 2 ,..., W m )] (19.48) 
where W t := gl (Yj g , W dl ), with ^ := [Y d( (l), Y dl (2), Y dt {T)\ 

Definition 2 

A rate vector {R\, . . . , R m ) is said to be feasible for the m source-destination pairs 
(si,di),t — 1, . . . , m, with total power constraint /"total, if there exists a sequence of 
[(2 TRl , . . . , 2 TRm ), T, P( T) ] codes satisfying the total power constraint P t otai, such that 
PP ->• 0 as T ->• oo. 

The preceding definitions are presented with total power constraint Ptotai- If an individual 
power constraint /^d is placed on each node, then Equation (19.47) should be modified: 

1 T 

- J>, 2 (0 < And, for/GiV (19.49) 
t=\ 

and correspondingly modify the rest of the definitions to define the set of feasible rate 
vectors under an individual power constraint. 
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19.3.3 The transport capacity 

The capacity region, is the closure of the set of all such feasible vector rates. As in Section 
19.2 we will, focus mainly on the distance-weighted sum of rates. 



Definition 3 

As in Section 19.2, the network's transport capacity C T is 

m 

C T := sup ^ R t ■ p t 

(R\,...,R m ) feasible ^>_[ 

where p t :— p St d e is the distance between si and d t , and R t :— R Sl d t - In the following, due 
to limited space, a number of results from information theory will be presented without the 
formal proof. For more details the reader is referred to Xie and Kumar [33]. 



19.3.4 Upper bounds under high attenuation 



(rl) The transport capacity is bounded by the network's total transmission power in 
media with y > 0 or 8 > 3. 

For a single link (s, d), the rate R is bounded by the received power at d. In wireless 
networks, owing to mutual interference, the transport capacity is upper-bounded by the 
total transmitted power P tot!L \ used by the entire network. 

In any planar network, with either positive absorption, i.e. y > 0, or with path loss 
exponent S > 3 



ci(y, 8, p min ) 

LT< , ' -Ttotal 



(19.50) 



where 



c\(y, S, p min ) := 



2 2S+1 logee _]/pmta/2 (2 - e _]/pmta/2 ) 



8 2 p a 



2S+1 



(1 — e^Wml"/ 2 ) 



2 2S+5 (3S - 8)loge 
(<5 - 2) 2 05 -S)/^- 1 ' 



if y > 0 

if y — 0 and 8 > 3 



(19.51) 



(r2) The transport capacity follows an O(n) scaling law under the individual power 
constraint, in media with y > 0 or 8 > 3. 

Consider any planar network under the individual power constraint P\„d. Suppose that either 
there is some absorption in the medium, i.e. y > 0, or there is no absorption at all but the 
path loss exponent 8 > 3. Then its transport capacity is upper-bounded as follows: 

C T < , n (19.52) 

G 

where c\(y, 8, p m j n ) is given by Equation (19.51). 
As in the previous section we use notation: 



/ = 0(g) if lim sup [/(«)/#(«)] < +oo 

«-»+00 
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f = Q(g) ifg = 0(f); f = ®(g) if / = 0(g) as well as g = 0(f). Thus, all O(-) results 
are upper bounds, all £!(•) results me lower bounds, and all &(■) results are sharp order 
estimates for the transport capacity. For n nodes located in an area of A square meters, it is 
shown in Section 19.2 that the transport capacity is of order 0[y/(An)] under a noninfor- 
mation theoretic protocol model. If A itself grows like n, i.e. A — ®(«), then the scaling 
law is O [^/(An)] — 0(n), which coincides with the information-theoretic scaling law here. 
In fact, A must grow at least this rate since nodes are separated by a minimum distance 
Pmin > 0, i.e. A — £2(n), and so the O (n) result here is slightly stronger than the O [^/(An)] 
result in the previous section. 

(r3) If either y > 0 or S > 2 in any linear network, then 

C X < C2(] ^; Pm ' J ■ P M (19.53) 



where 



ci(Y, <5, p m i n ) : = 



2 e -yP min loge 



23-1 
min 

2S(S Z - S - l)loge if y = 0and,5 > 2 



(1 - e-yp™ m ) 2 (\ — e~ 2r/ " aia )p^ if y > 0 

;2 



05-1)205-2^-1' 



(19.54) 



(r4) For any linear network, if either y > 0 or S > 2, then the transport capacity is 
upper-bounded as follows: 

„ ^ Cl(Y, 8, Pmin)^ind 

C T < ~ n 

a 1 

where C2(y, 8, p m i n ) is as in Equation (19.44). 



19.3.5 Multihop and feasible lower bounds under high attenuation 

The 0(n) upper bound on transport capacity is tight for regular planar networks in media 
with y > 0 or S > 3, and it is achieved by multihop. The 'multihop strategy' is defined as 
the following. Let Y\ t denote the set of all paths from source se to destination da, where by 
such a path n we mean a sequence (si — jo, ju j z = di) with j q ^ j r for q ^ r. 

The total traffic rate Ri to be provided to the source destination pair (si, d t ) is split over 
the paths in in such a way that if traffic rate X n > 0 is to be carried over path n, then 
YlneP ^ = R-i- On each path n, packets are relayed from node to next node. On each 
such hop, each packet is fully decoded, treating all interference as noise. Thus, only point- 
to-point coding is used, and no network coding or multiuser estimation is employed. Such 
a strategy is of great interest and it is currently the object of much protocol development 
activity. 

The following result implies that, when y > 0 or S > 3, the sharp order of the transport 
capacity for a regular planar network is 0(«), and that it can be attained by multihop. 

(r5) In a regular planar network with either y > 0 or S > 1, and individual power 
constraint P ind 

Ct > S [ — - — ^ ■ n 

~ \c 3 (y,S)P ind + a 2 J 
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where 

4(l + 4y)e- 2 Y - 4e~ 4 > / 



C3(K,<5) := 



2y(l-e- 2 y) ' ify>0; 
168 2 + (2jt - 16)8 -it if y = 0 and 8 > 1 



(8 - 1)(2<5 - 1) 

and S(x) denotes the Shannon function S(x) :— [log(l + x)] /2. 

This order of distance weighted sum of rates is achievable by multihop. Multihop is 
order-optimal in a random scenario over a regular planar network in media with y > 0 
or <5 > 3, providing some theoretical justification for its use in situations where traffic is 
diffused over the network. 

Consider a regular planar network with either y > 0 or S > 1, and individual power 
constraint find. The n source-destination pairs are randomly chosen as follows: every source 
Si is chosen as the node nearest to a randomly (uniformly i.i.d.) chosen point in the domain, 
and similarly for every destination d t . Then lim Prob [ R t — c/*J(n log«) is feasible for 

n— >oo 

every £ e {1,2, ...,«}] = 1 for some c > 0. Consequently, a distance weighted sum of 
rates Cj — Q[n/^/(\ogn)] is supported with probability approaching one as rj — > oo. This 
is within a factor l/^/log n of the transport capacity 0(«) possible when S > 3. 

(r6) A vector of rates (/?[, R2, ■ . ■ , R „,) can be supported by multihop in a planar 
network in media with y > 0 or S > 1, if the traffic can be load balanced such that 
no node is overloaded and no hop is too long. 

This is a fairly straightforward result saying nothing about order optimality, and is 
provided only in support of the above theme that multihop is an appropriate architecture 
for balanceable scenarios. 

(r7) A set of rates (Ri , R2, .../?„,) for a planar network can be supported by multihop 
if no hop is longer than a distance p, and for every 1 <;'<«, the traffic to be 
relayed by node 



f—^ {ir 6 n f : node i belongs to ir) 



where 



Ci{y, S, p min ) :-- 



p 2S [c A {y, 8, PmiJ^ind + C 2 ] 

23+23 e -yp min 



yp l+2S ' ify>0 

2 2+2S if y = 0 and S > 1 



19.3.6 The low-attenuation regime 

In this scenario no absorption, i.e. y — 0, and small path loss exponent are assumed. In this 
case coherent relaying with interference subtraction (CRIS) is considered an interesting 
strategy for information transmission in the following scenarios. For a source-destination 
pair (s, d), the nodes are divided into groups, with the first group containing only the source, 
and the last group containing only the destination d. Call the higher numbered groups 
'downstream' groups, although they need not actually be closer to the destination. Nodes 
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in group i, for 1 < / < k — 1, dedicate a portion P,* of their power to coherently transmit 
for the benefit of node k and its downstream nodes. Each node k employs interference 
subtraction during decoding to subtract out the known portion of its received signal being 
transmitted by its downstream nodes. 

(r8i) If there is no absorption, i.e. y — 0, and the path loss exponent S < 3/2, then 
even with a fixed total power P to tai, an y arbitrarily large transport capacity can be 
supported by CRIS in a regular planar network with a large enough number of 
nodes n. 

(r8ii) If y — 0 an d <5 < 1, then even with a fixed total power Ptotai> CRIS can support 
a fixed rate R min > 0 for any single source-destination pair in any regular planar 
network, irrespective of the distance between them. 

A similar result exists for the regular linear networks. 

(r9i) If y — 0 and 8 < 1, then even with a fixed total power P to tai> an Y arbitrarily large 
transport capacity can be supported by CRIS in a regular linear network with a 
large enough number of nodes n. 

(r9ii) If y — 0 and S < 1/2, then even with a fixed total power P t otai» CRIS can support 
a fixed rate R m \ n > 0 for any single source-destination pair in any regular linear 
network, irrespective of the distance between them. 

A superlinear ®{n°) scaling law with 1 < 9 < 2 is feasible for some linear networks when 
y — 0 and 8 < 1 . 

(rlO) For y — 0 and individual power constraint fi n( j for every 0.5 < S < 1, and 1 < 
6 < 1/8, there is a family of linear networks for which the transport capacity is 
Cj — Q(n 6 ). This order optimal transport capacity is attained in these networks 
by CRIS. 

19.3.7 The Gaussian multiple-relay channel 

The results for the low-attenuation regime rely on the following results for the Gaussian 
multiple-relay channel. An example of a four-node network with two parallel relays as 
shown in Figure 19.12. Consider a network of n nodes with a,y the attenuation from node i 
to node j and i.i.d. additive A^O, a 2 ) noise at each receiver. Each node has an upper bound 
on the power available to it, which may differ from node to node. Suppose there is a single 
source-destination pair (s, d). We call this the Gaussian multiple relay channel. 

The first result addresses the case where each relaying group consists of only one node. 
The strategy used is CRIS . Consider the Gaussian multiple-relay channel with coherent mul- 
tistage relaying and interference subtraction. Consider M + 1 nodes, sequentially denoted 




2 



•2 



Figure 19.12 A four-node network with two parallel relays [33]. 
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by 0, 1, . . . , M, with 0 as the source, M as the destination, and the other M — 1 nodes 
serving as M — 1 stages of relaying. 

(rl 1) Any rate R satisfying the following inequality is achievable from 0 to M; 

\ 2" 



R < min S 

l<j<M 



1 



j /Jfc-1 



£ £<W^ 



k=l \i=0 



where P,* > 0 satisfies J2kLi+i p ik < p i- 

For the network setting in (rll), Theorem 3.1 in Gupta and Kumar [36] shows that a rate 
Ro is achievable if there exist some {R t , R2, ■ . ■ , Rm-i) such that 



Rm-\ < s 



M-2 



P M,M-l I ° 2 + £ P M, 



1=0 



and 



R„, < min 



[ ( 

S 

V 



pR 

m+\,m 

m— 1 

£=0 



/? m+ i + min 5 

m+2<k<M 



pR 

r lc,m 



m— 1 

<=0 / 



for each m =0, 1 , . . . , M — 2, where 



£ < < M. 



From the above, recursively for m — M — 2, M — 1, . . . , 0, it is easy to prove that 



R m < min S 

m+\<j<M 



- 2 + £^ £^ 



£=0 



k=m 



Form = 0, this inequality is exactly (rl 1), showing a higher achievable rate. The right-hand 
side (RHS) in (rl 1) can be maximized over the choice of order of the M — 1 intermediate 
nodes. The relaying can also be done by groups, and the next result addresses this. As above, 
maximization can be done over the assignment of nodes to the groups. 

Consider again the Gaussian multiple-relay channel using coherent multistage relaying 
with interference subtraction. Consider any M + 1 groups of nodes sequentially denoted 
by A^o, N\, . . . N M with = {s } as the source, N M = {d} as the destination, and the other 
M — 1 groups as M — 1 stages of relay. Let «, be the number of nodes in group N,, i e 
{0, 1, . . . , M). Let the power constraint for each node in group Nj be P,/n,- > 0. 

(rl2) Any rate R satisfying the following inequality is achievable from 5 to d: 

2" 



R < min S 

l<j<M 



1 j A-i y 

— £ ( £ OC Ni Njs/Pik/ni ■ Hi 
° k=\ \i=0 / 



where > 0 satisfies J2k=i+i — a ^xNj := minfa^ : k e Ni, I e 

Nj}, i,j e{0,l,...,M}. 



As pointed out earlier, more on the results (rl)-(rl2) can be found in Xie and Kumar [33]. 
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19.4 COOPERATIVE TRANSMISSION IN WIRELESS 
MULTIHOP AD HOC NETWORKS 

The technique discussed in this section allows us to transmit reliably to far destinations 
that the individual nodes are not able to reach without rapidly consuming their own battery 
resources, even when using multihop links discussed so far. The results are of interest 
in both ad hoc and sensor networks. The key idea is to have the nodes simply echo the 
source's (leader) transmission operating as active scatterers while using adaptive receivers 
that acquire the equivalent network signatures corresponding to the echoed symbols. The 
active nodes in the network operate either as regenerative or nonregenerative relays. The 
intuition is that each of the waveforms will be enhanced by the accumulation of power 
due to the aggregate transmission of all the nodes while, if kept properly under control, 
the random errors or the receiver noise that propagate together with the useful signals will 
cause limited deterioration in the performance. The avalanche of signals triggered by the 
network leaders forms the so-called opportunistic large array (OLA). 

In contrast to Sections 19.2 and 19.3, we are interested in this section in a method 
that utilizes the network as a distributed modem, where one or few sources are effectively 
transmitting data and all the other users are operating as repeaters. A fresh look into the 
concept of repeaters as a form of cooperative transmission came recently from References 
[37-40]. 

We will assume that, in a network of N nodes transmitting over a shared medium, each 
node is part of a multiple stage relay of a single source transmitting toward a remote receiver 
whose position is unknown to all the nodes. If no node in the network is powerful enough to 
communicate reliably with the remote receiver, the problem is referred to as the reach-back 
problem. Coordination among nodes in a large network is an extremely difficult task. In 
a cooperative transmission mechanism for which cooperation is obtained in a distributed 
fashion the source (leader) transmits a pulse with complex envelope p m (t) out of an M-ary 
set of waveforms. The resulting signal at the ;'th receiver is r,-(f) = s,-, m (f) + n,-(f) where 
Si,m(t) is the network-generated signature of the mth symbol. If N nodes echo exactly the 
same symbol, 

N 

Si,m(t) = A Kn (t)p m [t - r (> (0] , m = 0, . . . , M - 1 

n=l 

where (t) is the i th receiver AWGN with variance Nq, r (i „ (t) is the delay of the link between 
the ;th and the nth node, including the asynchronism of the beginning of transmission for 
each node n, and A,,„(f) is the product of a complex fading coefficient &>,>(0, the transmit 
power P t and the channel average gain, e.g. oc (1 + (i,>) _a ' ° (log-normal fading), where 
d,-,„ is the distance, and the decay constant between the th and nth nodes. 
The following assumptions are used: 

(al) A,,„(f) and r, „(0 are constant over multiple symbol durations T s ; the nodes are 
quasi-stationary for a time much greater than T s . 

(a2) The delays are r (j i < r, 2 < • • • < at, where the minimum delay r, j corresponds 
to the leader. To avoid ISI, the upper bound for the effective symbol rate is R s = 
1/Ts — 1/At, where At is the maximum delay spread of s,>,(0 for all ;'. The 
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delay spread for node is denned as 



(t - Xif ■ |5,-, m (0| 2 df 



— OO 




S iim (t)\ 2 dt 



where the average delay 




t ■ \Si, m (t)\ 



S Um (t)\ 2 dt 



dt 



and thus, At — max, a Ti . Echoes that come from farther away are strongly attenu- 
ated (by «a d~ a ); therefore, the echoes received at node i are nonnegligible only for 
those coming from nodes within a certain distance Ad, which essentially depends 
on the transmit power and path loss. Hence, R s can be increased by lowering the 
transmit power, capitalizing on spatial bandwidth reuse. In the reach back prob- 
lem, however, the delay spread is At ~ sup ; [r, jv — t,-,i] because the receiver is 
roughly at the same distance from all nodes. 

(a3) T s is fixed for all nodes to c\ Ax, where C\ is a constant taken to satisfy the ISI 
constraint. With (a3), we guarantee that no ambiguity will occur at the nodes in 
timing their responses. The transmission activity of the node is solely dependent 
on the signal that the node receives. Based on the evolution of s, jm (f), we can 
distinguish two phases: (1) the earlier receive phase, when the upstream waves 
of signals approach the node, and (2) the period after the firing instant, which 
we call the rest phase, where the node hears the echoes of the downstream wave 
of signals fading away (for the regenerative case, the firing instant occurs shortly 
after the time when the node has accumulated enough energy to detect the signal). 
The switching between the two modes can be viewed as a very elementary form 
of time-division duplex (TDD). 

(a4) The leader (and also the nodes in the regenerative case) transmits pulses with 
complex envelope p m (t) having limited double-sided bandwidth W and, approxi- 
mately, duration T p . By sampling at the Nyquist rate, N p — T p W is the approxi- 
mate length of the sequence [p m (k/W)} of samples. Multipath propagation can be 
simply included in the model by increasing the number of terms in the summation 
in s,>(f); therefore, it does not require special attention. In fact, when we neglect 
the propagation of errors and noise that occurs in the case of regenerative and 
nonregenerative repeaters, respectively, the OLA itself is equivalent to a multipath 
channel, created by a set of active scatterers. In the regenerative case, the ideal 
OLA response is 



n=l 

The nonregenerative OLA scattering model is more complex due to the feedback effect, 
which implies that not one but several signal contributions are scattered by each source. 



N 




(19.55) 
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The received OLA response is 



(19.55a) 



For every possible path in the network, there is a contribution to the summation in Equation 
(19.55a) that has an amplitude equal to the product of all the path link gains traveled 
so far and a delay equal to the sum of all the path delays. Theoretically, the number of 
reflections N' — > oo because the signals and their amplified versions keep cycling in the 
network and adding up. If properly controlled, the contributions will keep adding up and 
then opportunistically serve the purpose of enhancing the signal. Hence, the key for the 
nonregenerative design is to control the noise that accompanies the useful signal. In both 
regenerative and nonregenerative cases, the received signal can be rewritten as the following 
convolution: 



where g, (f) is the network impulse response, which is analogous to that of a multipath 
channel. Based on Equation (19.46), the idea is to let the nodes operate as regenerative 
and nonregenerative repeaters and avoid any complex coordination procedure to forward 
their signals at the network layer and share the bandwidth at the MAC layer. In addition, no 
channel state information is used. The information flow is carried forward using receivers 
that are capable of tracking the unknown network response g, (?) or, directly, the signature 

waveforms s,- jBI (0 = gi(t)* p m (t). We should expect that the OLA behaves as a frequency- 
selective channel. Nodes' mobility causes changes of the response g,(f) over time. If most 
of the network is stationary and N is large, the inertia of the system will be such that mobile 
nodes will cause small changes in gi(t). 

Since the transmission channel is bandlimited with passband bandwidth W , the signature 
waveform p m {t) will have to be bandlimited and, therefore, uniquely expressible through its 
samples taken at the Nyquist rate 1 / T c , where T c — 1 / W . In general, p m (f ) corresponds to a 
finite number of samples N p and is approximately time limited with duration T p N p /W. 
Introducing the vectors p m , g, and r-, such that 



r t {t) = gi(tfp m (t) + ni(t) 



(19.56) 



{Pm}k = Pm(kT c ), k = 0, . . . N p - 1 

{giU = fsmc(itWr)gi(kT c + IJc - r)dx, k = 0, . . .N, - 1 
{r,-}* = n (kT c + UT c )k = 0, . . . , Nt + N p - 2 
{n,h = n,(kT c + lj c ), k = 0, . . . , N { + N p - 2 



where N t is the number of samples needed to represent gi(t), we have 




By using the following Toeplitz convolution matrix: 



[gi }k-n, 



n = 0, . . . , N p - 1; k = 0, . . . , N t + N p - 2 



we obtain 



i"; = G,p m + n, 



(19.57) 
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19.4.1 Transmission strategy and error propagation 

The transmission of the OLA is led by a predetermined source node in the network. All the 
other nodes form multiple stages of relays to either flood the network with the information 
from the source, or just to pass the information to a remote receiver. The intermediate nodes 
in OLA have a choice of whether to relay or not, depending on the performance at that node. 
In the regenerative scheme, the OLA nodes has the choice of retransmitting its detected 
symbol or staying silent. Only nodes that are connected actively reply, where connectivity 
is defined as follows. 

Definition 1 

The ;'th regenerative node is connected if, based on its estimates of all possible signatures 
Gip m and receiver noise variance, the pairwise symbol error probability of the ;'th receiver 
not considering error propagation) is below a fixed upper-bound e , i.e. max Pr{m — > fi) < 

m 

s, V/j, m,m = 0, . . . , M — 1. 

In the N s samples contained in each symbol period, the time instant selected for the 
detection and subsequent echo is the first sample N t < N s at which the node is connected. 
If there is no such sample, the node will never echo the signal (but it may obviously detect 
the information at his own risk.) 

In the nonregenerative scheme, every node that achieves the SNR constraint amplifies 
the signal coming from the other nodes as well as their receiver noise. Hence, the noise «, 
has a rather complex structure since it includes the noise that comes from every node that 
has transmitted previously and all its subsequent amplifications along with the signal. Since 
the geographical area in an ad hoc or sensor network is limited, the delay spread of each 
node response will also be limited, as far as the signal-to-noise contribution is concerned. 
Considerations on the SNR can be deduced by considering the inherent recursive structure 
of the signal composition. Details can found in Scaglione and Hong [40]. 

Definition 2 

The ;'th nonregenerative node is said to be connected if the signal to noise ratio at the node 
is above a fixed threshold > f . 

1 9.4.2 OLA flooding algorithm 

In this section, we compare numerically OLA with the more traditional ways of flooding 
the network described in Chapter 13. The flooding algorithm in Royer and Toh [41] is 
commonly indicated as one of the simplest ways of distributing information in the network 
or of searching for a path to the desired destination to initialize table-driven protocols. 
Some interesting alternatives are the probabilistic scheme [42] and the scalable broadcast 
algorithm [43] . Most of these methods require the MAC and physical layer to provide virtual 
transmission pipes (obtained typically with contention) that connect each pair of nodes, 
which imitates wired networks. The approach is legitimate but obviously inefficient. In fact, 
in solving the network broadcast problem, it is natural to utilize and integrate in the design 
the fact that wireless devices are physically broadcasting. This is precisely what happens 
in the OLA, where, contrary to the networking approaches, the transmission protocol and 
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cooperation strategy operate at the physical layer. In OLA framework, the receiver has to 
solve an equivalent point-to-point communication problem without requiring higher layers ' 
interventions (MAC or network layer). The benefits from eliminating two layers are higher 
connectivity and faster flooding. However, the OLA flooding algorithm cannot be used for 
some broadcasting applications such as route discovery due to the fact that the higher layer 
information is eliminated. 

Each node in the OLA is assumed to have identical transmission resources; therefore, 
any node has the ability of assuming the role of a leader. The leader can be chosen to be the 
leader of a troop, the clusterheads in clustering algorithms, or simply some node that has 
information to send. 



19.4.3 Simulation environment 

Various network broadcasting methods were analyzed using the ns2 network simulator. 
The simulation parameters are specified in Table 19.1 [40, 44]. Physical layer resources of 
the IEEE 802.11 DSSS PHY specifications [45] for the 2.4 GHz carrier were used. Each 
node-to-node transmission is assumed to experience independent small-scale fading with 
Rayleigh coefficients of variance 1. The large-scale fading is deterministic, and the path 
loss model is based on the model used in ns2 [46], where the free space model is used for 
distance d < d c (the cross-over distance) and the two-ray ground reflection model is used 
for d > d c , where d c — 4-it/X. The position of the 'leader' is randomly selected, and the 
OLA is regenerative. 

There are three parameters that define the simulation setting. The first is the point- 
to-point average SNR (averaged over the small scale fading), which is defined as 
SNR p 2p(d) — P t /(Nod a ) where d a is the path loss. The second is the transmission ra- 
dius d P 2p = [Pt/ (No^)]~ a , which is equal to the distance at which the SNR p 2 P (d) = £ 
using the specified path loss model. The exact SNR at each node is different due to the ac- 
cumulation of signals. Therefore, we define a third parameter, which is the node SNR at the 
detection level SNR det = | \g t \ \ 2 /E{ \ \n, | | 2 }.The value of SNR del can be mapped one-to-one 
into the node error rate if error propagation is neglected and provides a criterion equivalent 
to the one in definition 1 to establish whether a node is connected or not. In all cases, the 
threshold on SNRdet is the same as the required point-to-point SNR p 2 P (d) used to define 
network links in conventional networks, and let this value be £. 

To simplify the network simulations, it is assumed that the transmission propagates 
through the network approximately in a multiple ring structure shown in Figure 19.13. In 



Table 19.1 Simulation parameters 



Simulation Paramaters 



Values 



Network area 
Radius ofTX 
Payload 

Number of trials 
Modulation 

Bandwidth (IEEE 802.11) 



350 x 350 m 
100 m 
64 bytes/packet 
100 
BPSK 
83.5 Mbs 
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Figure 19.13 Nodes in the ring Ar transmit approximately simultaneously. 
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Figure 19.14 Connectivity ratio vs number of nodes in the network. 

each ring, we prune away the nodes that do not have strong enough SNRdet, but we do not 
detect and retransmit at the exact time when the SNR det reaches the threshold. Because we 
just partition the network geographically, we can expect, in general, nonuniform and lower 
error rates than the ones prescribed by the threshold on SNRd et . In the experiments, it was 
assumed that the signal space is perfectly estimated at each relaying node. This assumption 
is practical, because the network is static, and when the number of training symbols is 
sufficiently large, the noise variance caused by the contribution of the estimation error can 
be neglected. Figure 19.14 shows to what degree the network is connected according to 
definition 1 when the threshold for (SNRd e t)dB is 10 dB. Specifically, the connectivity ratio 
(CR) is shown, which is defined as the number of nodes that are 'connected', over the 
total number of nodes in the network. The nodes' transmit power and thermal noise are 
constant and are fixed so that (SNR ; ,2p)dB = 10 at the distances d p 2 P = 100, 80 and 60 m, 
representing the transmission radii. 

For d P 2p — 100 m, the CR is 100 % even at very low node density. As we shorten the 
radius of transmission, the connectivity of the network will decrease. Figure 19.15 plots 
the delivery ratio (DR), which is defined as the ratio between the average number of nodes 
that receive packets using a specific flooding algorithm over the number of nodes that are 
connected in multiple hops, i.e. nodes for which there exists a path from the leader that is 
formed with point-to-point links having SNR /72/) above a fixed threshold. The only cause of 
packet loss considered in Williams and Camp [44] is the fact that the packet is not delivered 
because it is dropped by the intermediate relays' queues to reduce the nodes congestion. 
Routing, MAC and physical layer errors and their possible propagation are ignored in the 
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Figure 19.15 Delivery ratio vs number of nodes in the network. 

definition of DR. DR essentially shows how routing and MAC problems can reduce the 
probability of successfully reaching the nodes. Hence, according to Williams and Camp 
[44], the simple flooding algorithm achieves 100 % DR, even if it might create longer delays 
and instability due to the increased level of traffic. In the OLA, the accumulation of signal 
energy may still allow extra nodes (besides the ones that have a multihop route) to receive 
the broadcasted packets reliably. Therefore, if we calculate the ratio between the number 
of nodes connected in the OLA according to definition 1 and the number of nodes that are 
connected through multihop point-to-point links, we must be able to achieve more than 
100% DR. Using the parameters in Table 19.1, Figure 19.15 plots the DR vs the number 
of nodes. It is shown that there can be remarkable gains in connectivity over any scheme 
operating solely on point-to-point links. 

The end-to-end delay is the time required to broadcast the packet to the entire network. 
In the OLA flooding algorithm, there is no channel contention, and therefore, the overhead 
necessary for carrier sensing and collision avoidance used in IEEE 802.11 is eliminated. 
With the reduction of overheads and the time saved by avoiding channel contention, it 
is clear that the speed of flooding will be much higher than the traditional broadcasting 
methods. Figure 19.16 shows the end-to-end delay vs the number of nodes in the network. 
The end-to-end delay is only in the order of milliseconds for a packet payload of 64 bytes, 
which coincides with the symbol period T s times the number of bits in a packet. 



19.5 NETWORK CODING 

In this section we consider a communication network in which certain source nodes multicast 
information to other nodes on the network in multihops where every node can pass on any 
of its received data to others. The question is how the data should be sent and how fast each 
node can receive the complete information. Allowing a node to encode its received data 
before passing it on, the question involves optimization of the multicast mechanisms at the 
nodes. Among the simplest coding schemes is linear coding, which regards a block of data 
as a vector over a certain base field and allows a node to apply a linear transformation to a 
vector before passing it on. 
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Figure 19.16 End-to-end delay vs number of nodes in the network. 



(a) (b) 




Figure 19.17 Illustration of network coding. 

As in Chapter 7, we define a communication network as a pair (G, S), where G is a finite 
directed multigraph and S (source) is the unique node in G without any incoming edges. A 
directed edge in G is called a channel in the communication network (G, S). A channel in 
graph G represents a noiseless communication link on which one unit of information (e.g. 
a bit) can be transmitted per unit time. The multiplicity of the channels from a node X to 
another node Y represents the capacity of direct transmission from X to Y. We assume that, 
every single channel has unit capacity. 

At the source S, information is generated and multicast to other nodes on the network 
in the multihop fashion where every node can pass on any of its received data to other 
nodes. At each nonsource node which serves as a sink, the complete information generated 
at S is recovered. Now the question is how fast each sink node can receive the complete 
information. As an example, consider the multicast of two data bits, b\ and b^, from the 
source S in the communication network depicted by Figure 19.17(a) as both nodes Y and 
Z. One solution is to let the channels ST, TY, TW and WZ carry the bit b\ and channels 
SU, UZ, UW and WY carry the bit b^. Note that, in this scheme, an intermediate node sends 
out a data bit only if it receives the same bit from another node. For example, the node T 
receives the bit b\ and sends a copy on each of the two channels TY and TW. Similarly, the 
node U receives the bit b% and sends a copy into each of the two channels UW and UZ. We 
assume that there is no processing delay at the intermediate nodes. 
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Unlike a conserved physical commodity, information can be replicated or coded. The 
notion of network coding refers to coding at the intermediate nodes when information is 
multicast in a network. Let us now illustrate network coding by considering the communi- 
cation network depicted in Figure 19.17(b). Again, we want to multicast two bits b\ and I?2 
from the source S to both the nodes Y and Z. A solution is to let the channels ST, TW and 
TY carry the bit b\ , channels SU, UW and UZ carry the bit hi, and channels WX, XY and XZ 
carry the exclusive-OR b\ © b^. Then, the node Y receives b\ and b\ © b2, from which the 
bit b2 — b\® ib\ © b%) can be decoded. Similarly, the node Z can decode the bit b\ from hi 
and b\ © bi as b\ — bi © (b\ © bq). The coding/decoding scheme is assumed to have been 
agreed upon beforehand. In order to discuss this issue in more detail, in this section we first 
introduce the notion of a linear-code multicast (LCM). Then we show that, with a 'generic' 
LCM, every node can simultaneously receive information from the source at rate equal to 
its max-flow bound. After that, we describe the physical implementation of an LCM, first 
when the network is acyclic and then when the network is cyclic followed by a presentation 
of a greedy algorithm for constructing a generic LCM for an acyclic network. The same 
algorithm can be applied to a cyclic network by expanding the network into an acyclic 
network. This results in a 'time-varying' LCM, which, however, requires high complexity 
in implementation. After that, we introduce the time-invariant LCM (TILCM). 

Definition 1 

Over a communication network a flow from the source to a nonsource node T is a collection 
of channels, to be called the busy channels in the flow, such that: (1) the subnetwork defined 
by the busy channels is acyclic, i.e. the busy channels do not form directed cycles; (2) for 
any node other than S and T, the number of incoming busy channels equals the number 
of outgoing busy channels; (3) the number of outgoing busy channels from S equals the 
number of incoming busy channels to T. The number of outgoing busy channels from S will 
be called the volume of the flow. The node T is called the sink of the flow. All the channels 
on the communication network that are not busy channels of the flow are called the idle 
channels with respect to the flow. 

Definition 2 

For every nonsource node T on a network (G, S), the maximum volume of a flow from the 
source to T is denoted max flowc(T), or simply mf(T) when there is no ambiguity. 

Definition 3 

A cut on a communication network (G, S) between the source and a nonsource node T 
means a collection C of nodes which includes S but not T. A channel XY is said to be in the 
cut C if X e C and Y £ C. The number of channels in a cut is called the value of the cut. 

19.5.1 Max-flow min-cut theorem (mfmcT) 

For every nonsource node T , the minimum value of a cut between the source and a node 
T is equal to mf(T). Let d be the maximum of mf(T) over all T. In the sequel, the symbol 
Q will denote a fixed d-dimensional vector space over a sufficiently large base field. The 
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information unit is taken as a symbol in the base field. In other words, one symbol in the 
base field can be transmitted on a channel every unit time. 

Definition 4 

An LCM v on a communication network (G, S) is an assignment of a vector space v(X) 
to every node X and a vector v(XY) to every channel XY such that (1) v(S) = £2; (2) 
v(XY) e v(X) for every channel XY; and (3) for any collection p of nonsource nodes in 
the network ({v(T) : T e p}) — ({v(XY) : X g pY e p}). The notation (•> is for linear 
span. Condition (3) says that the vector spaces v(T) on all nodes T inside p together have 
the same linear span as the vectors v(XY) on all channels XY to nodes in p from outside p. 



LCM v data transmission: The information to be transmitted from S is encoded as a 
d-dimensional row vector, referred to as an information vector. Under the transmission 
mechanism prescribed by the LCM v, the data flowing on a channel XY is the matrix 
product of the information (row) vector with the (column) vector v(XY). In this way, the 
vector v(XY) acts as the kernel in the linear encoder for the channel XY. As a direct 
consequence of the definition of an LCM, the vector assigned to an outgoing channel 
from a node X is a linear combination of the vectors assigned to the incoming channels 
to X. Consequently, the data sent on an outgoing channel from a node X is a linear 
combination of the data sent on the incoming channels to X. Under this mechanism, the 
amount of information reaching a node T is given by the dimension of the vector space 
v(T) when the LCM v is used. 



Coding in Figure 19.17(b) is achieved with the LCM v specified by 
v(ST) = v(TW) = v(TY) = (1 0) T 

v(SU) = v(UW) = v(UZ) = (0 1) T (19.58) 

and 

v(WX) = v(XY) = v(XZ) = (1 1) T 

where ( ) T stands for transposed vector. The data sent on a channel is the matrix product of 
the row vector (b\ b 2 ) with the column vector assigned to that channel by v. For instance, 
the data sent on the channel WX is 

(b x , b 2 )v(WX) = (bi , b 2 ) (1 1) T = bi + b 2 

Note that, in the special case when the base field of £2 is GF(2), the vector b\ + b 2 reduces 
to the exclusive-OR b\ © b 2 in an earlier example. 

Proposition PI 

For every LCM v on a network, for all nodes T dim[v(T)] < mf (T). To prove it fix a 
nonsource node T and any cut C between the source and T v(T) C (v(Z) : Z $ C) — 
(v(YZ) :Y eC and Z £C). Hence, dim[f(r)] < dim((jj(yZ) : Y e C and Z C», 
which is at most equal to the value of the cut. In particular, dim[u(r)] is upper-bounded 
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by the minimum value of a cut between S and T, which by the max-flow min-cut theorem is 
equal to mf(T). 

This means that mf(T) is an upper bound on the amount of information received by T when 
an LCM v is used. 

19.5.2 Achieving the max-flow bound through a generic LCM 

In this section, we derive a sufficient condition for an LCM v to achieve the max-flow bound 
on dim[i>(r)] in Proposition 1. 

Definition 

An LCM v on a communication network is said to be generic if the following 
condition holds for any collection of channels X^Yi , X2Y2, ■ ■ ■ , X m Y m for 1 < m < d : (*) 
v(X k ) <£ ({v(XjYj) : j ^ k}) for 1 < k < m if and only if the vectors v(XiYi), v(X 2 Y 2 ), 
. . . , v(X m Y m ) are linearly independent. If v(X\Y\), 11(^2^2), . . . , v(X m Y m ) are linearly in- 
dependent, then v(X k ) <f_ [{v(XjYj) : j ^ k}) since v{X k Y k ) e v(X k ). A generic LCM re- 
quires that the converse is also true. In this sense, a generic LCM assigns vectors which are 
as linearly independent as possible to the channels. 

With respect to the communication network in Figure 19.17(b), the LCM v defined by 
Equation (19.58) is a generic LCM. However, the LCM u defined by 

u(ST) = u(TW) = u(TY) = (1 0) T 

u(SU) = u(UW) = u(UZ) = (0 1) T (19.59) 

and 

u(WX) = u(XY) = u(XZ) = (1 0) T 
is not generic. This is seen by considering the set of channels {ST, WX} where 

«(*>=»(*>=((;) . (;)} 

Then u(S) <£ {u(WX)} and u(W) <£ (u(ST)), but u(ST) and u(WX) are not linearly inde- 
pendent. Therefore, u is not generic. Therefore, in a generic LCM v any collection of 
channels XYi , XY2, ■ ■ ■ , XY m from a node X with m < dim[v(X)] must be assigned linearly 
independent vectors by v. 

Theorem Tl 

If v is a generic LCM on a communication network, then for all nodes T, dim [v(T)] = 
mf(T). To prove it, consider a node T not equal to S. Let / be the common value of 
mf(T) and the minimum value of a cut between S and T . The inequality dim[i>(T)] < / 
follows from Proposition 1. So, we only have to show that dim[jj(T)] > /. To do so, let 
dim(C) = dim((j;(X, Y) : X e C andY fcC )) for any cut C between S and T. We will show 
that dim[ii(T)] > / by contradiction. Assume dim[ii(T)] < / and let A be the collection of 
cuts U between S and T such that dim(£/) < /. Since dim[i)(T)] < / implies V\{T] e A, 
where v is the set of all the nodes in G, A is nonempty. 
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By the assumption that v is a generic LCM, the number of edges out of S is 
at least d, and dim({5}) — d > f. Therefore, {S} $ A. Then there must exist a min- 
imal member U G A in the sense that for any Z e ^ (p, U\{Z} $ A. Clearly, 
U ^ {S} because {S} $ A. Let K be the set of channels in cut U and B be the set 
of boundary nodes of U , i.e. Z e B if and only if Z e {/ and there is a channel 
(Z, Y) such that Y gU. Then for all (fefi, t>(W) £ (v(X, Y) : (X, T) e K) which 
can be seen as follows. The set of channels in cut t/XfVF} but not in K is given by 
{(X, W) : X e t/\{W}}. Since v is an LCM {v(X, W) : X e E/\{W}) C v(W). If v(W) C 
(v(X,y) : (X,Y) e AT), then T') : X' e U\{W}, Y' # U\{W}) the subspace 
spanned by the channels in cut U\{W], is contained by (v(X, Y) : (X, Y) G K). This im- 
plies that dim(£/\{W}) < dim(£/) < /is a contradiction. Therefore, for all W G B, v(W) <f_ 
(v(X, Y) : (X, Y) g AT). For all (W, Y) e K, since {v(X, Z) : (X, Z) e T)}> C 
(v(X, Y) : (X, Y) e K) , v(W) £ (v(X, Y) : (X, Y) e K) implies that v(W) <£ (v(X, Z) : 
(X,Z)e K\{(W,Y)}). 

Then, by the definition of a generic LCM {v(XY ) : (X, Y) e AT} is a collection of vectors 
such that dim({7 ) = min(| K |, d). Finally, by the max-flow min-cut theorem, \K\ > f, and 
since d > /, dim(f/) > /. This is a contradiction to the assumption that U G A. The 
theorem is proved. 

An LCM for which dim[i)(r)] = mf(T) for all T provides a way for broadcasting a 
message generated at the source S for which every nonsource node T receives the message 
at rate equal to mf(T). This is illustrated by the next example, which is based upon the 
assumption that the base field of Q is an infinite field or a sufficiently large finite field. In 
this example, we employ a technique which is justified by the following arguments. 

Lemma 1 

Let X, Y and Z be nodes such that mf(X) — i, mf(Y) — j, and mf(Z) — k, where i < j and 
i > k. By removing any edge UX in the graph, mf(X) and mf(Y) are reduced by at most 1, 
and mf(Z) remains unchanged. 

To prove it we note that, by removing an edge UX, the value of a cut C between the 
source S and node X (respectively, node Y ) is reduced by 1 if edge UX is in C, otherwise, 
the value of C is unchanged. By the mfmcT, we see that mf(X) and mf(Y) are reduced by 
at most 1 when edge UX is removed from the graph. Now consider the value of a cut C 
between the source S and node Z. If C contains node X, then edge UX is not in C, and, 
therefore, the value of C remains unchanged upon the removal of edge UX. If C does not 
contain node X, then C is a cut between the source S and node X. By the mfmcT, the value 
of C is at least . Then, upon the removal of edge UX, the value of C is lower-bounded by 
i — 1 > k. Hence, by the mfmcT, mf(Z) remains to be k upon the removal of edge UX. 

Example El 

Consider a communication network for which mf(T) — 4, 3 or 1 for nodes T in the network. 
The source S is to broadcast 12 symbols a.\, . . . , a^ taken from a sufficiently large base 
field F. (Note that 12 is the least common multiple of 4, 3 and 1.) Define the set T, = {T : 
mf(T) = i], for i = 4, 3, 1. For simplicity, we use the second as the time unit. We now 
describe how a\, . . . , a\2 can be broadcast to the nodes in I4 ,Tj, T/, in 3, 4 and 12 s, 
respectively, assuming the existence of an LCM on the network for d = 4, 3, 1. 
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(1) Let v\ be an LCM on the network with d = 4. Let ai = (aj a 2 03 04), «2 = 
(a 5 a 6 a 7 a 8 ) and a 3 = (a g a 10 a n a 12 ). In the first second, transmit as the in- 
formation vector using v\ , in the second second, transmit a 2 , and in the third second, 
transmit CK3. After 3 s, after neglecting delay in transmissions and computations all 
the nodes in I4 can recover <X],a 2 and as . 

(2) Let r be a vector in F A such that ({r}) intersects trivially with for all T in 
Tj, i.e.({r, i>i(r)}> = F A for all T in T 3 . Such a vector r can be found when F is 
sufficiently large because there are a finite number of nodes in T3. Define b t — a t r 
for i —1,2,3. Now remove incoming edges of nodes in T4, if necessary, so that 
mf(T ) becomes 3 if T is in T4 , otherwise, mf{T ) remains unchanged. This is based on 
Lemma 1). Let V2 be an LCM on the resulting network with d — 3.Let/3 = (bi b2 bs) 
and transmit fi as the information vector using v 2 in the fourth second. Then all the 
nodes in Tj can recover ji and hence a\, (X2 and 0C3. 

(3) Letsi and S2 be two vectors in F 3 such that {{s\, S2}) intersects with V2(J) trivially for 
all T in T;, i.e.({siSi, V2(T)}) — F 3 for all T in Ti. Define y, = fist for = 1, 2. 
Now remove incoming edges of nodes in I4 and Tj, if necessary, so that mf{T) 
becomes 1 if T is in I4 or Tj, otherwise, mf(T) remains unchanged. Again, this is 
based on Lemma 1). Now let V3 be an LCM on the resulting network with d — 1. In 
the fifth and the sixth seconds, transmit y\ and yi as the information vectors using 
V3. Then all the nodes in Tj can recover p. 

(4) Let t\ and f 2 be two vectors in F 4 such that ({?!, t 2 }) intersects with ({r, Vi(T)}) 
trivially for all T inT;, ;.e. ({t\, ?2, ^1 (?")}> = F 4 for all T inT;. Define S\ = a^ty 
and ^2 = a 1 f 2 • In the seventh and eighth seconds , transmit S 1 and 8 2 as the information 
vectors using 1)3. Since all the nodes in Tj already know b\, upon receiving Si and 
82, ai can then be recovered. 

(5) Define 83 — oc 2 tj and 84 — a 2 t 2 . In the ninth and tenth seconds, transmit 83 and 84 as 
the information vectors using 1)3. Then oc 2 can be recovered by all the nodes in Tj . 

(6) Define 8 5 — ajf; and 85 — 0^2 . In the eleventh and twelveth seconds, transmit S 5 
and &t as the information vectors using V3. Then 0,3 can be recovered by all the nodes 
in 



So, in the ;'th second for / = 1, 2, 3, via the generic LCM v\, each node in T4 receives 
all four dimensions of a, , each node in T3 receives three dimensions of a, , and each node 
in Ti receives one dimension of a, . In the fourth second, via the generic LCM i>2, each 
node in T 3 receives the vector fi, which provides the three missing dimensions of a\, a 2 
and a 3 (one dimension for each) during the first 3 s of multicast by v\. At the same time, 
each node in Ti receives one dimension of p. Now, in order to recover /3, each node in Ti 
needs to receive the two missing dimensions of /3 during the fourth second. This is achieved 
by the generic LCM 113 in the fifth and sixth seconds. So far, each node in Ti has received 
one dimension of a, for i = 1, 2, 3 via V\ during the first 3 s, and one dimension of a, 
for i = l, 2, 3 from ft via v 2 and u 3 during the fourth to sixth seconds. Thus, it remains 
to provide the six missing dimensions of a\, «2 and 013 (two dimensions for each) to each 
node in Ti , and this is achieved in the seventh to the twelfth seconds via the generic LCM 
U3. The previous scheme can be generalized to arbitrary sets of max-flow values. 
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19.5.3 The transmission scheme associated with an LCM 

Let v be an LCM on a communication network (G, S), where the vectors v(SX) assigned 
to outgoing channels SX linearly span a cf-dimensional space. As before, the vector v(XY) 
assigned to a channel XY is identified with a af-dimensional column vector over the base 
field of Q. by means of the choice of a basis. On the other hand, the total information to be 
transmitted from the source to the rest of the network is represented by a of-dimensional 
row vector, called the information vector. Under the transmission scheme prescribed by 
the LCM v, the data flowing over a channel XY is the matrix product of the information 
vector with the column vector v(XY). We now consider the physical realization of this 
transmission scheme associated with an LCM. 

A communication network (G, S) is said to be acyclic if the directed multigraph G 
does not contain a directed cycle. The nodes on an acyclic communication network can 
be sequentially indexed such that every channel is from a smaller indexed node to a larger 
indexed node. On an acyclic network, a straightforward realization of the above transmission 
scheme is as follows. Take one node at a time according to the sequential indexing. For 
each node, 'wait' until data is received from every incoming channel before performing the 
linear encoding. Then send the appropriate data on each outgoing channel. This physical 
realization of an LCM over an acyclic network, however, does not apply to a network that 
contains a directed cycle. This is illustrated by the following example. 

Example 2 

Let p, q, and r be vectors in f2, where p and q are linearly independent. Define v(SX) = 
p, v(SY) = q and v(WX) = v(XY) = v(YW) = r. 

This specifies an LCM v on the network illustrated in Figure 19.18 if the vector ;• is a 
linear combination of p and q. Otherwise, the function v gives an example in which the 
law of information flow is observed for every single node but not observed for every set of 
nodes. Specifically, the law of information flow is observed for each of the nodes X, Y and 
W, but not for the set of nodes {X, Y, W}. Now, assume that p — (1 0) r , q — (0 l) r and 
>' = (ll) r . 

Then, v is an LCM. Write the information vector as {b\ b^), where b\ and bi belong to 
the base field of Q.. According to the transmission scheme associated with the LCM, all 
three channels on the directed cycle transmit the same data b\ + bj. This leads to the logical 
problem of how any of these cyclic channels acquires the data b\ + b2 in the first place. 

In order to discuss the transmission scheme associated with an LCM over a network 
containing a directed cycle, we need to introduce the parameter of time into the scheme. 
Instead of transmitting a single data symbol (i.e. an element of the base field of £2) through 




Figure 19.18 An LCM on a cyclic network. 
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each channel, we shall transmit a time-parameterized stream of symbols. In other words, 
the channel will be time-slotted. As a consequence, the operation of coding at a node will 
be time-slotted as well. 



19.5.4 Memoryless communication network 

Given a communication network (G, S) and a positive integer r , the associated memoryless 
communication network denoted as (G (j \ S) is defined as follows. The set of nodes in G <T) 
includes the node S and all the pairs of the type [X, t], where X is a nonsource node in G 
and t ranges through integers 1 - r. The channels in the network (G (r) , S) belong to one 
of the three types listed below. For any nonsource nodes X and Y in (G, S): 

(1) for t < r, the multiplicity of the channel from S to [X, t] is the same as that of the 
channel SX in the network (G, S); 

(2) for t < t, the multiplicity of the channel from [X, t] to [Y, t + 1] is the same as 
that of the channel XY in the network (G, S); 

(3) for t < r, the multiplicity of the channel from [X, t] to [X, r] is equal to 

max flowc(X) — mfG(X). 



Lemma 2 

The memoryless communication network (G (r) , S) is acyclic. 
Lemma 3 

There exists a fixed number e, independent of t, such that for all nonsource nodes X in 
(G, S), the maximum volume of a flow from S to the node [X, r] in (G (t) , S) is at least 
r — e times mfG(X). For proof see Li et al. [47]. 

Transmission of data symbols over the network (G (r) , S) may be interpreted as 'memo- 
ryless' transmission of data streams over the network (G, S) as follows: 

(1) A symbol sent from S to [X, t] in (G (l) , S) corresponds to the symbol sent on the 
channel SX in (G, S) during the time slot t. 

(2) A symbol sent from [X, t] to[Y, t + 1] in (G (t) , S) corresponds to the symbol sent 
on the channel XY in (G, S) during the time slot t + 1. This symbol is a linear 
combination of symbols received by X during the time slot t and is unrelated to 
symbols received earlier by X. 

(3) The channels from [X, t] to [X, t] for t < r signify the accumulation of received 
information by the node X in (G, 5) over time. 

Since this is an acyclic network, the LCM on the network (G (r) , S) can be physically 
realized in the way mentioned above. The physical realization can then be interpreted as a 
memoryless transmission of data streams over the original network (G, S). 
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19.5.5 Network with memory 

In this case we have to slightly modify the associated acyclic network. 
Definition 1 

Given a communication network (G, S) and a positive integer r, the associated communi- 
cation network with memory, denoted as (G [r] , S), is defined as follows. The set of nodes 
in G [r] includes the node S and all pairs of the type [X, t], where X is a nonsource node in 
G and t ranges through integers 1 to r. Channels in the network (G [t1 , S) belong to one of 
the three types listed below. For any nonsource nodes X and Y in (G, S); 

(1) for t < t, the multiplicity of the channel from S to [X, t] is the same as that of the 
channel SX in the network (G, 5); 

(2) for t < x, the multiplicity of the channel from [X, t] to [Y, t + 1] is the same as that 
of the channel XY in the network (G, S); 

(3) for t < x, the multiplicity of channels from [X, t] to [X, t + 1] is equal to t x 
mfG(X). 

(4) The communication network (G [r] , S) is acyclic. 

(5) Every flow from the source to the node X in the network (G <T) , S) corresponds to 
a flow with the same volume from the source to the node [X, t] in the network 

(G [T] , S). 

(6) Every LCM v on the network (G (j \ S) corresponds to an LCM u on the network 
(G [T] , S) such that for all nodes X in G: dim[u([X, t])] = dim[v(X)]. 



19.5.6 Construction of a generic LCM on an acyclic network 

Let the nodes in the acyclic network be sequentially indexed as X 0 — S, X\, X 2 , ■ ■ ■ ,X„ 
such that every channel is from a smaller indexed node to a larger indexed node. The 
following procedure constructs an LCM by assigning a vector v(XY) to each channel XY , 
one channel at a time. 

{ 

for all channels XY 

v(XY) = the zero vector; II initialization 
for 0 =0; j<n; j + +) 

{ 

arrange all outgoing channels X jY from Xj 
in an arbitrary order; 

take one outgoing channel from Xj at a time 
{ 

let the channel taken be XjY; 
choose a vector w in the space v(X j) such 
that w $ (v(U Z) : U Z e f > for any collection 
% of at most d — 1 channels with 
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v(Xj) £ {v(UZ) : UZ e 
v(XjY) = w; 

} 

v(X — the linear span by vectors 
v(XXj + i) on all incoming channels XX j+i 
to X j+ i; 

} 

} 

The essence of the above procedure is to construct the generic LCM iteratively and make 
sure that in each step the partially constructed LCM is generic. 

19.5.7 Time-invariant LCM and heuristic construction 

In order to handle delays, we can use an element a{z) of F [(z)] to represent the z-transform 
of a stream of symbols ao, a\, a 2 , . that are sent on a channel, one symbol at 
a time. The formal variable z is interpreted as a unit-time shift. In particular, the vector 
assigned to an outgoing channel from a node is z times a linear combination of the vectors 
assigned to incoming channels to the same node. Hence a TILCM is completely determined 
by the vectors that it assigns to channels. On the communication network illustrated in 
Figure 19.18, define the TILCM v as 

v(SX) = (1 0) T , v(SY) = (0 1) T , v(XY) = (z z 3 ) T 
v(YW) = (z 2 z) T and v(WX) = (z 3 z 2 ) 

By formal transformation we have for example 

v(XY) = (z z 3 ) = z(l - z 3 )(l 0) T + z(z 3 z 2 ) T 
= z[(l - z 3 )v(SX) + v(WX)] 

Thus, v(XY ) is equal to z times the linear combination of v(SX) and v(WX) with coefficients 
1 — z 3 and 1, respectively. This specifies an encoding process for the channel XY that does 
not change with time. It can be seen that the same is true for the encoding process of every 
other channel in the network. This explains the terminology 'time-invariant' for an LCM. 

To obtain further insight into the physical process write the information vector as 
[a(z) b(z)], where 

a(z) — ^2 l a j 2 ' andfr(z) = bjZ j 
j>o j>o 

belong to F[(z)]. The product of the information (row) vector with the (column) vector 
assigned to that channel represents the data stream transmitted over a channel 

[a(z) b(z)] ■ v(SX) = [a(z) b(z)] ■ (1 0) T 

= a(z) -> (a 0 , a u a 2 , a 3 , a 4 , a 5 , . . . , a,, . . .)[a{z) b(z)] ■ v(SY) 
= b{z) -> (b 0 , b u b 2 , b 3 , b 4 , b 5 ,...,b t ,.. .)[a(z) b(z)] ■ v(XY) 

— zaiz) + z 3 b(z) — > (0, ao, a\ , a 2 + bo, a 3 + b\ , a 4 + b 2 , ■ ■ ■ , 
a M +V3,-.-)Kz) b(z)]-v(YW) 

— z 2 a(z) + zb(z) (0, b 0 , a 0 + b\, a\ + b 2 , a 2 + b 3 , a 3 + b 4 , . . . , 
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a f _ 2 + £>,_!,... )[a(z) b(z)]-v(WX) 
= z 3 a(z) + z 2 b(z) -* (0, 0, b 0 , a Q + b i ,a i + b 2 , a 2 + b 3 , 
a,-3 + b,- 2 , ■ ■ •) 

Adopt the convention that a, — b, — 0 for all t < 0. Then the data symbol flowing over the 
channel XY , for example, at the time slot t is + b,^ for all t > 0. If infinite loops are 
allowed, then the previous definition of TILCM v is modified as follows: 



v(SX) = (1 0) 1 , v(SY) = (0 1) T 



v(XY) = (1 - z 3 y l (z z 3 ) T , v(Y W) = (1 - z^-^z 2 z) 



3n-1/,2 ,yt 



and 



= (i -z 3 rV z 2 ) 

The data stream transmitted over the channel XY , for instance, is represented by 
( £ fly V ^ A • u(Xy ) = J>,z' +1 + ^z^ 3 )/(l - z 3 ) 



^(aj^+bjz^) 



E z3 ' = EE(^ 3/+y+1 +^ 3,+;+3 ) 

j>0 j>0 



!>0 



= X! Z '( fl '-1 + fl '-4 + a >-l H 1" V3 + ^-6 + &t-9 H ) 



That is, the data symbol a t -\ + a t -4 + a ( _7 + • • • + b,^ + b,^ + b t -9 + • • • is sent on the 
channel XY at the time slot t. This TILCM v, besides being time invariant in nature, is a 
'memoryless' one because the following linear equations allows an encoding mechanism 
that requires no memory: 



v(XY) = zv(SX) + zv(WX) 
v(YW) = zv(SY) + zv(XY) 



(19.60) 



and 



v(WX) = zv(YW) 



19.5. 7. 1 TILCM construction 

There are potentially various ways to define a generic TILCM and, to establish desirable 
dimensions of the module assigned to every node. In this section we present a heuristic 
construction procedure based on graph-theoretical block decomposition of the network. For 
the sake of computational efficiency, the procedure will first remove 'redundant' channels 
from the network before identifying the 'blocks' so that the 'blocks' are smaller. A channel 
in a communication network is said to be irredundant if it is on a simple path starting at 
the source, otherwise, it is said to be redundant. Moreover, a communication network is 
said to be irredundant if it contains no redundant channels. In the network illustrated in 
Figure 19.19, the channels ZX, TX and TZ are redundant. 

The deletion of a redundant channel from a network results in a subnetwork with the same 
set of irredundant channels. Consequently, the irredundant channels in a network define an 
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Figure 19.19 Redundant channels in a network. 

irredundant subnetwork. It can be also shown that, if v is an LCM (respectively, a TILCM) 
on a network, then v also defines an LCM (respectively, a TILCM) on the subnetwork 
that results from the deletion of any redundant channel. In addition we say that two nodes 
are equivalent if there exists a directed path leading from one node to the other and vice 
versa. An equivalence class under this relationship is called a block in the graph. The source 
node by itself always forms a block. When every block 'contracts' into a single node, the 
resulting graph is acyclic. In other words, the blocks can be sequentially indexed so that 
every interblock channel is from a smaller indexed block to a larger indexed block. 

For the construction of a 'good' TILCM, smaller sizes of blocks tend to facilitate the 
computation. The extreme favorable case of the block decomposition of a network is when 
the network is acyclic, which implies that every block consists of a single node. The opposite 
extreme is when all nonsource nodes form a single block exemplified by the network illus- 
trated in Figure 19. 19. The removal of redundant channels sometimes serves for the purpose 
of breaking up a block into pieces. For the network illustrated in Figure 19.19, the removal 
of the three redundant channels breaks the block {T, W, X, Y, Z} into the three blocks 



In the construction of a 'good' LCM on an acyclic network, as before, the procedure 
takes one node at a time according to the acyclic ordering of nodes and assigns vectors to 
outgoing channels from the taken node. For a general network, we can start with the trivial 
TILCM v on the network consisting of just the source and then expand it to a 'good' TILCM 
v that covers one more block at a time. 

The sequential choices of blocks are according to the acyclic order in the block decom- 
position of the network. Thus, the expansion of the 'good' TILCM v at each step involves 
only incoming channels to nodes in the new block. A heuristic algorithm for assigning vec- 
tors v(XY) to such channels XY is for v(XY) to be z times an arbitrary convenient linear 
combination of vectors assigned to incoming channels to X. In this way, a system of linear 
equations of the form of Ax — b is set up, where A is a square matrix with the dimension 
equal to the total number of channels in the network and x is the unknown column vector 
whose entries are v{XY) for all channels XY . The elements of A and b are polynomials in z. 
In particular, the elements of A are either ± 1 , 0, or a polynomial in z containing the factor z. 
Therefore, the determinant of A is a formal power series with the constant term (the zeroth 
power of z) being ±1, and, hence is invertible in F{z). According to Cramer's rule, a unique 
solution exists. This is consistent with the physical intuition because the whole network 
is completely determined once the encoding process for each channel is specified. If this 
unique solution does not happen to satisfy the requirement for being a 'good' TILCM, then 



{T},{W, X, 7}and{Z}. 
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the heuristic algorithm calls for adjustments on the coefficients of the linear equations on 
the trial-and-error basis. 

After a 'good' TILCM is constructed on the subnetwork formed by irredundant channels 
in a given network, we may simply assign the zero vectors to all redundant channels. 

Example 

After the removal of redundant channels, the network depicted by Figure 19.19 consists 
of four blocks in the order of [S], [W, X, Y], {Z} and {T}. The subnetwork consisting 
of the first two blocks is the same as the network in Figure 19.18. When we expand the 
trivial TILCM on the network consisting of just the source to cover the block {W , X, Y], 
a heuristic trial would be 

v(SX) = (1 0) r v(SY) = (0 l) r 

together with the following linear equations: 

v(XY) = zv(SX) + zv(W X) 
v(YW) = zv(SY) + zv(XY) 

and 

v(WX) = zv(YW). 

The result is the memoryless TILCM v in the preceding example. This TILCM can be 
further expanded to cover the block {Z} and then the block {T}. 

19.6 CAPACITY OF WIRELESS NETWORKS USING MIMO TECHNOLOGY 

In this section an information theoretic network objective function is formulated, which 
takes full advantage of multiple input multiple output (MIMO) channels. The demand for 
efficient data communications has fueled a tremendous amount of research into maximizing 
the performance of wireless networks. Much of that work has gone into enhancing the design 
of the receiver, however considerable gains can be achieved by optimizing the transmitter. 

A key tool for optimizing transmitter performance is the use of channel reciprocity. Ex- 
ploitation of channel reciprocity makes the design of optimal transmit weights far simpler . It 
has been recognized that network objective functions that can be characterized as functions 
of the output signal-to-interference noise ratio (SINR) are well suited to the exploitation 
of reciprocity, since it permits us to relate the uplink and downlink objective functions 
[48-50]. It is desirable, however, to relate the network objective function to information 
theory directly, rather than ad-hoc SINR formulations, due to the promise of obtaining 
optimal channel capacity. This section demonstrates a link between the information the- 
oretic, Gaussian interference channel [51] and a practical network objective function that 
can be optimized in a simple fashion and can exploit channel reciprocity. The formulation 
of the objective function permits a water filling [52] optimal power control solution, that 
can exploit multipath modes, MIMO channels and multipoint networks. 

Consider the network of transceivers suggested by Figure 19.20. Each transceiver is 
labeled by a node number. The nodes are divided into two groups. The group 1 nodes 
transmit in a given time-slot, while the group 2 nodes receive. In alternate time-slots, group 
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Figure 19.20 Network layout. 

2 nodes transmit, while group 1 nodes receive, according to a time division duplex (TDD) 
transmission scheme. 

The channel is assumed to be channelized into several narrow band frequency channels, 
whose bandwidth is small enough so that all multipath is assumed to add coherently. This 
assumption allows the channel between any transmit antenna and any receive antenna to be 
modeled as a single complex gain and is consistent with an orthogonal frequency division 
multiplexing (OFDM) modulation scheme. All transceivers are assumed to have either 
multiple antennas or polarization diversity, for both transmit and receive processing. The 
link connecting nodes may experience multipath reflections. For this analysis we assume 
that the receiver synchronizes to a given transmitter and removes any propagation delay 
offsets. The channel, for a given narrow band, between the transmitter and receiver arrays 
is therefore modeled as a complex matrix multiply. 

Let l\ be an index into the group 1 transceivers, and £2 an index into the group 2 
transceivers. When node £2 is transmitting during the uplink, we model the channel between 
the two nodes asacomplexmatrixH 12 (^i, £2) of dimension M\(l\) x M 2 (l2), where Mi{i\) 
is the size of the antenna array at node l\, and M2H2) is the size of the array at node I2. 
Polarization diversity is treated like additional antennae. In the next TDD time slot, during 
the downlink, node £\ will transmit and the channel from £\ to £2 is described by the 
complex M2(£\) x M\(£i) matrix ^2\(£i, £\)- For every node pair {£\, £2) that forms a 
communications link in the network, we assign a M1MO channel link number, indexed by k 
or m, in order to label all such connections established by the network. Obviously not every 
node pair is necessarily assigned a link number, only those that actually communicate. We 
also define the mapping from the MIMO link number to the associated group 1 node, £\{k) 
and the associated group 2 node £2(k) by the association of k with the link [£\(k), l2{k)]. 
Because each channel is MIMO, a given node will transmit multiple symbols over possibly 
more than one transmission mode. The set of all transmission modes over the entire network 
is indexed by q and p. This index represents all the symbol streams that are transmitted from 
one node to another and therefore represents a lower level link number. The low level link 
numbers will map to its MIMO channel link, via the mapping k(q). Because our network will 
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exploit channel reciprocity, we assume that every uplink symbol stream indexed by q has 
an associated downlink symbol stream assigned to the same index. The <7'th uplink symbol 
stream is spread by an M\(q) x 1 complex vector g\{q), where M\(q) = M\{l\[k(q)]}. 
Similarly g 2 {q) is the associated transmit vector for the downlink. 
For each node, we group the transmit vectors into a single matrix, 

G 2 (*) = [g 2 (ail giiqi), giiquM)] (19.61) 

and 

Gi(*) = [gi( qi ), 8l (q 2 ), gi(.qu t ik))] (19-62) 

where k{q{) — k and there are M c (lc) transmission modes associated with MIMO link k. 
With these conventions the signal model can be written as: 

xi(n; k) = ij(n; k) + H 12 (£, k)G 2 (k)d 2 (n; k) (19.63) 

x 2 («; k) = i 2 (n; k) + H 21 (£, jt)Gi(it)di(n; k) (19.64) 

where X\(n;k) is the received complex data vector at sample n, and node l\{k), 
Hi 2 (Jt, m) = U n [li(k), l 2 (m)] is theMi[€i(Jt)] x M 2 [l 2 (m)] complex MIMO channel ma- 
trix for downlink transmission, n is a time/frequency index, that represents an independent 
reuse of the channel, either due to adjacent frequency channels (e.g. adjacent OFDM chan- 
nels) or due to multiple independent time samples, ii(n; k) is the interference vector seen 
at node i\{k) due to the other transmitting nodes as well as due to background radiation, 
and d 2 («; k) is the M c (k) x 1 downlink information symbol vector, transmitted for sample 
n. The analogous model for the uplink case is shown in (4). The interference vector can be 
written as 

i r (n; k)=J2 Hrt(k, m)G,{m)A,{n; m) + s r (n, k) (19.65) 

where s r (n, k), r = {1, 2}, is the background radiation noise vector seen by the receiving 
node l r {k) and t — {2, 1} is the transmission timeslot indicator. The convention is adopted 
that t = 2 if r = 1, otherwise t — 1 when r — 2. 

19.6.1 Capacity metrics 

In this section we will be interested in the asymptotic maximum achievable throughput 
under certain network constraints: 

(1) We will assume that signal energy from all the nodes in the network, other than the 
one from the link being processed, is treated as interference. This is due to difficulty 
of synchronizing a node to multiple interference sources and reduces the complexity 
of the receiver. 

(2) The interference i r (n;k) is modeled as complex circular Gaussian noise. The su- 
perposition of many distant interferers will appear Gaussian by the central limit 
theorem. 

Considering therefore the channel model described by Equations (19.63) and (19.65), we 
can write the mutual information between the source data vector and the received data vector 
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as [52, 53]: 



/[*.(*);«!,(*)] =log 2 



I + Rt) (k)R SlSl (k) 



C rl {k;G t ) 



(19.66) 



where, G r <r> {#/(<?)} represents all transmit weights stacked into a single parameter vector 
and where we neglect the n dependency for the random vectors x r (n; k) and d,(n;k). The 
interference and the signal covariance matrices are defined as: 



(19.67) 



A rt (k, m) = H rt (k, m)G,(m) (19.68) 

R**(*) = Kt(k, *)A£(Jfc, it), (19.69) 

and where R ErEr (£) is the covariance of the background noise vector s r (n, k). The covariance 
of the source statistics is assumed governed by the transmit weights G,(k), therefore the 
covariance of d,(k) is assumed to be the identity matrix. 

Let us now consider the introduction of complex linear beam-forming weights at the 
receiver. For each low level link q, we assign a receive weight vector w r (q), r e {1,2} for 
use at receiver l r [k(<7)] . The weights, in theory, are used to copy the transmitted information 
symbol, 

d t (n;q) = w?(q)x r (n;k) (19.70) 
= <(<7)i r («;£) + yv?(q)A rl (k, k)d,(n; k) (19.71) 

k(q) = k where, 

d,(n; *) = [d,(qi), d,(q 2 ), ■ ■ ■ d t (q Mc .(k))f (19.72) 
and k(qj) — k. The vector version of this can be written as: 

d,(n;k) = Wf(k)i r (n;k) + W?(k)A rl (k, k)d,(n;k) (19.73) 

where 

W r (*) = [w,( 9l ), w r (q 2 ), ■ ■ ■ w r (q MAk) j\ (19.74) 

and k(qj) — k. The structure of the transceiver channel model is illustrated in Figure 19.21 
for the uplink case, highlighting the first M c (k) low-level links, all assumed to be associated 
with the same MIMO-channel link. The MIMO channel link index k and sample index n are 
omitted for simplicity. The downlink case is the same after the 1 and 2 indices are swapped. 

Owing to the additional processing [52], the mutual information either remains the same 
or is reduced by the application of the linear receive weights, 



l[d,(k)-d t (k)]<l[Mk)\d,(k)] 



(19.75) 
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Figure 19.21 Uplink transceiver channel model. 

In addition to inequality, Equation (19.75), it can be also shown [54, 55] that, if the source 
symbols d,(q) are mutually independent then, 

/[a,(*);d,(*)]> I&(qy,d t (q)] (19.76) 

q: k(q)=k 

The lower bound of in Equation (19.76) is referred to as the decoupled capacity and can be 
written as: 



D rt (k;W r ,G t ) = nd,(q);d t (q)] 

q: k(q)=k 



(19.77) 



where the collection of all receive weights is stacked into the parameter vector W, o 
{w r (q)}. Equations (19.75) and (19.76) demonstrate that 



C rt (t,G,)>D r ,(k;W r ,G,) 



(19.78) 



The decoupled capacity is implicitly a function of the transmit weights g t (q) and the receive 
weights w r (q). Optimizing over these weights can always acheive the full capacity of the 
Gaussian interference channel and hence the upper bounds in Equation (19.78). 
First we write d t (n; q) as, 

d t (n; q) - w, H (<7)a,(<?) + w, H (<7)i r (<7), 
i r {n;q) = ^ H rt [k, k(p)]g,(p)d,(n; p) + e r (n, k) 



a r (q) = H rt (k, k)g,(q), k = k(q) 



(19.79) 
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Now the mutual information on the right-hand side of Equation (19.55a) can be written as, 

/[*(«); d tiq) ] = log 2 fl + 'if^f ) (19.80) 
J V wH( ? )R (V;V ( ? )w r ; 



rc*(/0=t 



= **,,,(*) + X a r {p)df{p) (19.81) 

where £ = £(#). In the following we will also use the fact that scrambling the transmit 
weights by an orthonormal scrambling matrix does not change the values of the mutual 
informations for the Gaussian interference channel. In other words, if 

G,{k) = G,(k)\,(k) (19.82) 
Y,(k)\?(k) = I 

then for all k, 

C rt (k;G) = C rt (k;G) (19.83) 

where G o {£;(<?)} is the parameter vector of all stacked, scrambled transmit weights, g,(q) 
drawn from the columns of G t (k). The proof is based on the fact that the mutual information 
is completely determined by the statistics, Ri r i r (£) and R StSt (£). From Equations (19.67) 
and (19.47), these depend only on the outer products G,(m)Gf(m), which are invariant 

with respect to orthonormal scrambling because G t (m)G t (m) = G t (m)Gf(m). Therefore 
replacing G,(m) with G,(m) does not change the mutual information. Based on this we 
have the following relation. For any set of network wide transmit weights G, there exists a 
set of receive weights w r (<7) and transmit weights g t (q) such that for all k, 

C rt (k; G t ) = C rl (k; G t ) = D rt (k; W r , G ( ) (19.84) 

where W r {w r (<7)} andG t { g, (q)}. Given the transmit weights G ( , the receive weights 
can be found from 

w r (q) = argmax Wr(<?) /[5,((7); d,{q)] = argmax Wr(<7) y r (q) = R^^a,.^) (19.85a) 

where y r (q) is the output signal to interference noise ratio (SINR) given by, 

, . a |w, H (< ? )a,.(< ? )| 2 MOK .. 
Yriq) = H , , D -. : — — (19.85b) 

For the formal proof see References [54-57]. Equations (19.84)— (19.85) are significant 
because they allow us to optimize the Gaussian interference channel using decoupled ca- 
pacity. The decoupled capacity can be shown to obey the reciprocity theorem [54, 55], 
which permits us to relate uplink network metrics to downlink network metrics. We can ei- 
ther consider minimizing total power subject to a channel capacity constraint, or maximizing 
channel capacity subject to a transmit power constraint. 

In References [54, 55] a technique called locally enabled global optimization (LEGO) 
is designed to fully exploit the reciprocity theorem. This technique transforms the op- 
timization over the transmit powers to one over the set of achievable output SINRs 
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in Equation (19.85). The LEGO algorithm can be efficiently implemented using lo- 
cal information, and requires only an estimate of the post beamforming interference 
power, y r (q) = w ; H (<7)Ri r i r w r (<7), and an estimate of the post-beamforming channel gain, 
T r ,{q,q) = \wf(q)H rt [k(q), k(q)]g t (q)\ 2 . If a collection of links through which we de- 
sire to compute a channel capacity is Q(m) = {q : k{q) — m], then the algorithm can be 
summarized as below. 



Algorithm 1 : LEGO 

(1) Update the remote receiver weights for every link during downlink transmission: 

w'i(p) = argmaxyi(p) 

Wi(p) 

(2) Set the remote transmit weights to a scaled version of the conjugated receive 
weights, 



= w*(p)J JT,(2>)/[wf (p)R SrSr Wl (p)] 



g 

(3) For each link p, estimate the associated post-beamforming interference power y\ (p) 
and relay this information back to the base station. 

(4) Update the base receiver weights for every link during uplink transmission: 

W 2 (p) = argmaxy 2 Q?)- 

w 2 (p) 

(5) For each link p, estimate the associated post-beamforming interference power 

yi(p)- 

(6) Update the target SINR for each link, by optimizing the linearized model over each 
aggregate set m : 

y(<?) = argmin V yiq ^^/ 2 ^ s.t .c(m) < V log[l + y(q)] 
forq e Q(m), 

y(p)' — a[y(p) — y(p)] + yip), for some 0 < a < l(a is initially set to 1). 

(7) Update the downlink transmit powers 7t2(q): 

x' 2 (q) = y(q)yi(q)/Tn(q, q) 

(8) Update the uplink transmit powers 7Ti(<7) and relay them back to the remote units. 

n[(.q) = y(q)yi(q)/T2i(q, q). 

(9) Set the base transmit weights to a scaled version of the conjugated receive weights, 

gl(p) = V* 2 (p)yjlZ 2 (p)/ [wf (p)Re r e r V>2(p)] 

(10) Return to step 1. 

References [54, 55] provide a numerical example of a four-cell wireless network as illus- 
trated in Figure 19.22. Each 1 km radius cell contains two remote units (RU), each of which 
communicates to two base stations. The base stations have eight antennae and the RUs two 
antennae. There are eight independent 50 kHz channels and thus each RU has 16 transmit 
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modes [i.e. each Q(m) contains 16 links]. The performance of LEGO is compared with a 
standard power management algorithm, that seeks to transmit a constant power for each 
link from each base station, and a single antenna network, that uses frequency division 
multiplexing to isolate each RU into a separate channel. Background radiation is assumed 
to be thermal white noise at room temperature, with an added 10 dB noise figure. As can be 
seen from the figure, the performance improvement of the LEGO algorithm is significant. 

(a) 
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Figure 19.22 (a) Cell network geometry; (b) LEGO performance: worst case capacity vs 
worst transmit power. (Reproduced by permission of IEEE [55].) 

19.7 CAPACITY OF SENSOR NETWORKS WITH 
MANY-TO-ONE TRANSMISSIONS 



In sensor networks the many-to-one throughput capacity is the per source data throughput, 
when all or many of the sources are transmitting to a single fixed receiver or sink [58-63]. 
Earlier in this chapter we have shown that the achievable per node throughput in a wireless 
network is 6 [ W/(n log «)] , where W is the transmission capacity and n is the total number 
of nodes in the network. 
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The result was based on the assumption that communications are one-to-one, and that 
sources and destinations are randomly chosen. It does not apply to scenarios where there are 
communication hot spots in the network. Since many-to-one communication causes the sink 
to become a point of traffic concentration, the throughput achievable per source node in 
this case is reduced. In this section we are only interested in the case where every source 
gets an equal (on average) amount of original data (not including relayed data) across to the 
sink. This is because otherwise throughput can be maximized by having only the sensors 
closest to the destination transmit. Equal share of throughput from every sensor is desired 
for applications like imaging where each sensor represents a certain region of the whole 
field and data from each part are equally important. When distributed data compression is 
used, this is again approximately the case. However, when conditional coding is used this 
may no longer be true, since the amount of processed data can vary from source to source. 
In order to achieve the above goal we use the following assumptions about the network 
architecture. 



19.7.1 Network architecture 

(1) The network is deployed in a field of circular shape. There are n nodes, sources (we 
will use nodes, sources and sensors interchangeably in subsequent discussions) de- 
ployed in a network. A sink/destination is located at the center of the network/circle. 
Each node is not only a source of data, but also a relay for some other sources to 
reach the sink. 

(2) A network where the nodes are randomly placed following a uniform distribution 
will be referred to as a randomly deployed network or a random network. In such a 
network we have no direct control over the exact location of the nodes. A network 
where we can determine the exact locations of the nodes will be referred to a as an 
arbitrary network. 

(3) Two network organizational architectures are considered: (a) a flat architecture where 
nodes communicate with the sink via possibly multi-hop routes by using peer nodes 
as relays; and (b) hierarchical architecture, where clusters are formed so that sources 
within a cluster send their data (via a single hop or multihop depending on the 
size of the cluster) to a designated node known as the clusterhead. The cluster- 
head can potentially perform data aggregation and processing and then forward 
data to the sink. In this study, we will assume that the clusterheads serve as simple 
relays and no data aggregation is performed. We will also assume that the com- 
munication between nodes and clusterheads and communication between cluster- 
heads and the sink are on separate frequency channels so that the two layers do not 
interfere. 

(4) Throughout the section we will assume that the sources transmit following a schedule 
that consists of time slots. 

(5) To simplify the resulting expressions, we assume the field has an area of 1. Nodes 
share a common wireless channel using omnidirectional antennas. We assume nodes 
use a fixed transmission power and achieve a fixed transmission range. We adopt the 
commonly used interference model, as earlier in this chapter. Let Xj and Xj be two 
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sources with distance dij between them. Then the transmission from X, to Xj will 
be successful if and only if 



for any source Xt that is simultaneously transmitting. In the following r will be 
referred to as the transmission range. There are two interference concepts here. A 
node may interfere with another node that is transmitting if it is within distance 
2r + A of that node. To see this consider two transmitting nodes. If one node is 
within 2r + A of the other node there will be an overlap between a circle of radius 
r around the first transmitting node and a circle of radius r + A around the second 
transmitting node. If the intended receiver is located within the overlapping area 
the transmission will fail because of interference. Therefore the two nodes need 
to be at least 2r + A apart. On the other hand a node will interfere with another 
node that is receiving if it is within distance r + A for obvious reasons. In the 
following both will be generally referred to as interference range. The distinction 
will be clear from the context. Also note that this interference model, Equation 
(19.76), essentially implies that no nodes can receive more than one transmission 
at a time. We will also assume that no node can transmit and receive at the same 
time. 

(6) The network scenario is depicted in Figure 19.23. The sink is placed at the center of 
this field. It receives all data generated by sources in the network. 

(7) In the following W refers to the transmission capacity of the channel in a flat network. 
In a hierarchical network W refers to the transmission capacity of the channel used 
within clusters. W refers to the transmission capacity of the channel used from the 
heads to the sink. The capacity is derived as a function of the transmission range, 
assuming the transmission range can provide connectivity. 



dij < r a.ndd k j > r + A, A > 0 



(19.76) 
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Figure 19.23 Many-to-one network scenario. 
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19.7.2 Capacity results 

In this section we summarize capacity results for the network defined above. For formal 
proofs of the results the reader is referred to References [58-63]. 

19. 7.2. 1 Capacity in a flat network 

(1) The maximum per node throughput in the network is upper bounded by W/n. 

(2) X = W/n can be achieved when every source can directly reach the sink. 

(3) X = W/n is not achievable if not every source can directly reach the destination and 

A > r. 

(4) X = W/n may be achieved in an arbitrary network when not every source can directly 
reach the destination and A < r . 

When the sink cannot directly receive from every source in the network, and assuming that 
the channel allocation does not take into account difference in traffic load, then X — W/n 
is not achievable with high probability regardless of the value of A. In this case we will 
use the upper bound on throughput by deriving the maximum number of simultaneous 
transmissions. 

Denote by A r the area of a circle of radius r, i.e. A r = nr 2 . Let random variable V r 
denote the number of nodes within an area of size A r and assume a total area of 1. We then 
have [58-63]: 

(5) In a randomly deployed network with n nodes, 

Pr (nA r — -Ja n n < V r < nA r + *Ja„ri) -» 1 as« — > oo 

where the sequence {«„} is such that lim^co an/n — e, s being positive but arbi- 
trarily small. 

(6) If a network has randomly deployed sources and the transmission range r is such that 
not all sources can directly reach the sink, then with high probability the throughput 
upper bound X — W/n is not achievable. 

(7) A randomly deployed network using multihop transmission for many-to-one com- 
munication can achieve throughput 

X > W (itr 2 - Ve) / {n (<\Ttr 2 + 4nrA + it A 2 + </e) } 

with high probability, when no knowledge of the traffic load is assumed and £ is as 
given in (1). 

In the following we will use a concept of virtual sources. As an example consider a simple 
network consisting of three sources and a sink, shown in Figure 19.24(a). The distance 
between adjacent nodes is r. Regardless of the value of A, when one source transmits, it 
interferes with all other sources in this network. Therefore only one source can transmit 
at a time. The number of interfering neighbors for any of the sources is two, which is the 
highest degree of the graph that represents the interference relationship in this network. 
Thus a schedule of length 3 allows all sources to transmit once during the schedule. The 
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Figure 19.24 (a) Chain network (b) virtual sources. 

load on the source closest to the sink, source 3, is 3X, since it carries the traffic of all three 
sources. The achievable throughput is then calculated as 3X = W/3, thus X = W/9. 

The way the schedule was calculated previously assigned the same share of the resources 
(time) to all the sources. Since we used the source with the highest need of resource (the one 
carrying the most traffic) to calculate the amount of resource needed, every other source is 
wasting resource. In our example we are giving every source the possibility of making three 
transmissions. Source 3 does indeed need all three transmissions, but source 2 only needs 
two and source 1 only needs one, hence a total of six transmissions. Now consider a similar 
network, only this time we have three sources that can reach the sink, shown in Figure 
19.24(b). We create a schedule where each one of the sources gets to transmit once and 
once only. However this time not all sources generate data. Using labels shown in Figure 
19.24(b), source 1 generates a packet and transmits it to source 2a. Source 2a relays the 
packet to source 3a, which then relays it to the sink. Then source 2b generates a packet 
and transmits it to source 3b, which relays it to the sink. Finally source 3c generates and 
transmits a packet to the destination. We can view each raw of sources in this network as an 
equivalent of a single source in the previous example, i.e. 2a and 2b combined are equivalent 
to 2 in Figure 19.24(a); 3a, 3b and 3c combined are equivalent to 3, in terms of interference 
and traffic load. We will define sources 2a, 2b and 3a, 3c as virtual sources in the sense that 
they each represent one actual source in the network but they are co-located in one physical 
source. Adopting this concept, in this network the highest number of interfering neighbors 
is five (with a total of six virtual sources all in one interference area) and therefore there 
exists a schedule of length 6 that enables every virtual source to transmit once. Since the 
traffic load is the same for all virtual sources, the resources will be shared equally and no 
source will be wasting its share. In this case we get X — W/6. Note that this is the largest X 
that could be obtained for the example in Figure 19.24(a). This concept allows us to define 
a 'traffic load-aware' schedule in the following way. 

(1) For each source node, create one virtual source for every source node whose traffic 
goes through this node, including itself. 



(2) Counting all the virtual sources we can determine the number of interfering neighbors 
(virtual sources) k . The new maximum degree of the interference graph is then k — 1. 
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(3) A schedule of length s < k exists which is equally shared among virtual sources. 

(4) The achievable throughput per node is simply the share obtained by any virtual 
source in the network, i.e. X = W/s > W / k. 

The concept of virtual source is used in References [58-63] to prove the following 
theorems: 



(8) A randomly deployed network using multi-hop transmission for many-to-one com- 

h =\ 2 + f] + + 

munication can achieve X > W / E A= J l//nT with high probability, when knowl- 
edge of the traffic load is assumed. l£ and n\ are the upper bounds on the number 
of virtual sources per actual source and the number of actual sources respectively, 
that are h hops away from the sink with high probability. 

(9) A randomly deployed network using multihop transmission for many-to-one com- 
munication can achieve a throughput arbitrarily close to W /{n(2 — Jtr 2 )}, when 
knowledge of the traffic load is assumed and A = 0. 

In a hierarchical network with H clusters (heads) we assume that each cluster head 
creates a cluster containing the sources closest to it. Within each cluster the communication 
is either via a single hop or via multihop, while the communication from clusterheads to the 
sink is assumed to be done via a single hop on a different channel. We assume that cluster 
heads cannot transmit and receive simultaneously. In order to avoid boundary problems, 
we will assume there is at least a distance of 2(2r + A) between any two clusterheads. We 
will also assume that each cluster covers an area of same size, as though not necessarily 
the same shape. Following these two assumptions and using result (5), we have with high 
probability that the number of nodes in each cluster is within (a„n) of n/H , where a n is such 
that lim^oo a n /n = e. Therefore the clusters essentially form a Voronoi tessellation of the 
field, where every cluster (or Voronoi cell) contains a circle of radius 2r + A. Consequently 
sources located near the boundary between two clusters will not have a higher number of 
interfering neighbors (in terms of virtual sources), due to low traffic load, than the ones 
closer to the clusterheads. Thus previous results are directly applicable and we do not have 
to be concerned with the boundary. 

The question of interest is whether there exists an appropriate number of clusters H that 
would allow the network to achieve X = W/n with high probability using clustering, when 
clusterheads have the same transmission capacity W as the sources. That W/n remains to 
be the upper bound is again obvious considering the fact that the sink cannot receive from 
more than one node (at rate W), and that there are n sources in the network. In References 
[58-63] the following results are proven: 

(10) In a network using clustering, where cluster heads have the same transmission 
capacity W as the sources, there exists an appropriate number of clusters H and 
an appropriate range of transmission r that would allow the network to achieve 
X — W/n with high probability as n — > oo. The range of transmission r must 
satisfy 

20r 4 + 36Ar 3 + 25A 2 r 2 + 8A 3 r + A 4 1 , 

— — pr < - and H > 3 - nr L 

r 2 - Ve (4r 2 + 4r A + A 2 - ±) ~ Jt 

In the case of A = 0 and letting */s ~ 0 , we need r < «J\/2Qti. 
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(11) In a network using clustering, where cluster heads have transmission capacity 
W, there exists an appropriate number of clusters H and an appropriate range of 
transmission r, as n — > oo , that allows the network to achieve X — W'/n with 
high probability. W'/n is also the upper bound on throughput in this scenario. The 
condition is H > (3 - Jtr 2 ) W'/W. 
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Energy-efficient 
Wireless Networks 



20.1 ENERGY COST FUNCTION 

In Chapter 5 we discussed the impact of MAC layer protocols on energy efficiency, including 
TCP controlled retransmissions. In this chapter, we extend this analysis to the network layer 
and focus on routing algorithms. We discuss how the error rate associated with a link affects 
the overall probability of reliable delivery, and consequently the energy associated with the 
reliable transmission of a single packet. For any particular link (i, j) between a transmitting 
node i and a receiving node j, let Tjj denote the transmission power and ptj represent 
the packet error probability. Assuming that all packets are of a constant size, the energy 
involved in a packet transmission, E t j, is simply a fixed multiple of 7} j. 

Any signal transmitted is affected by two different factors: attenuation due to the medium, 
and interference with ambient noise at the receiver. The attenuation is proportional to D K , 
where D is the distance between the receiver and the transmitter. The bit error rate associated 
with a particular link is essentially a function of the ratio of the received signal power to the 
ambient noise. In the constant-power scenario, T t j is independent of the characteristics of 
the link (i, j ) and is a constant. In this case, a receiver located further away from a transmitter 
will suffer greater signal attenuation (proportional to D K ) and will, accordingly, be subject 
to a larger bit-error rate. In the variable-power scenario, a transmitter node adjusts Tjj to 
ensure that the strength of the (attenuated) signal received by the receiver is independent of 
D and is above a certain threshold level Th. The minimum transmission power associated 
with a link of distance D in the variable-power scenario is T m — Th x y x D K , where y is 
a constant and K is the coefficient of channel attenuation (K > 2). Since Th is typically a 
technology-specific constant, we can see that the minimum transmission energy over such 
a link varies as E m (D) oc D K . 
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If links are considered error-free, then minimum hop paths are the most energy-efficient 
for the fixed-power case. Similarly, in the absence of transmission errors, paths with a large 
number of small hops are typically more energy efficient in the variable power case. However 
in the presence of link errors, none of the above choices may give optimal energy efficient 
paths. We now analyze the consequences of this behavior for the variable-power scenario 
and end-to-end (EER) and hop-by-hop (HHR) packet retransmission techniques. The anal- 
ysis for the fixed-power scenario is simpler, and is a special case of the variable-power 
scenario. Energy consumption for additional signal processing in the transmitter/receiver 
(modulation/demodulation) will be neglected. Modification of the models to include these 
losses is straightforward. 

In the EER case, a transmission error on any link leads to an end-to-end retransmission 
over the path. Given the variable-power formulation of E m , it is easy to see why breaking up a 
link of distance D into two shorter links of distance D\ and D2 such that D\ + D2 — D 
always reduces the total E m . To elaborate on this, let us consider communication between 
a sender (5) and a receiver (R) separated by a distance D. Let N represent the total number 
of hops between S and R, so that N — I represents the number of forwarding nodes i: 
i = {2, . . . , ./V}, with node i referring to the (i — l)th intermediate hop in the forwarding 
path. Node 1 refers to S and node N + 1 refers to R. In this case, the total energy spent 
in simply transmitting a packet once (without considering whether or not the packet was 
reliably received) from the sender to the receiver over the N — 1 forwarding nodes is: 

E t = f^' +1 =f>D*. +1 (20.1) 

i=l ( = 1 

where D, j refers to the distance between nodes i and j and a is a proportionality constant. 
To understand the tradeoffs associated with the choice of N — 1, we compute the lowest 
possible value of E t for any given layout of N — 1 . Using very simple symmetry arguments, 
it is easy to see that the minimum transmission energy case occurs when each of the hops 
are of equal length D/N. In that case, E t is given by 

N 

E t = Y^oiD K /N K = aD K /N K ' 1 
1=1 

We now consider how the choice of N affects the probability of transmission errors and the 
consequent need for retransmissions. Clearly, increasing the number of intermediate hops 
increases the likelihood of transmission errors over the entire path. Assuming that each of 
the N links has an independent packet error rate of /?i; n k, the probability of a transmission 
error over the entire path, denoted by p, is given by p — 1 — (1 — pn n k) N ■ 

The number of transmissions (including retransmissions) necessary to ensure the suc- 
cessful transfer of a packet between S and D is then a geometrically distributed random 
variable X, such that Pr{X = k) = p k ~ l x (1 - p), Vk. The mean number of individual 
packet transmissions for the successful transfer of a single packet is thus 1/(1 — p). Since 
each such transmission uses total energy E t given above, the total expected energy required 
in the reliable transmission of a single packet is given by: 

D K 1 aD K 
E t (EER) = a—— = w (20.2) 

w i-P N K -m- Piink f 
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Figure 20.1 Total energy costs ( K = 2, EER). 



By treating TV as a continuous variable and differentiating, it follows that the optimal value 
of the number of hops, N opt is given by: 

N opt = ~(K - 1)/ log(l - Phnk ) (20.3) 

The existence of the optimum value is demonstrated in Figure 20.1. 

In the case of the HHR model, the number of transmissions on each link is independent 
of the other links and is geometrically distributed. The total energy cost for the HHR case 
with N intermediate nodes, with each hop being of distance D/N and having a link packet 
error rate of pvuk, is 

N D k d k 

E t {HHR) = Va - M+1 = a (20.4) 

frf 1 - Pi,i+\ N K - { ■ (1 - /?i ink ) 

In this case, it is easy to see that the total energy required always decreases with increasing 
N. One should be aware that in a practical system at some point when N is sufficiently 
large, the signal processing energy will become comparable with the energy spent for 
transmissions. 



20.2 MINIMUM ENERGY ROUTING 

Energy-aware routing protocols typically compute the shortest-cost path, where the cost 
associated with each link is some function of the transmission (and/or reception) energy 
associated with the corresponding nodes. To adapt such minimum cost route determination 
algorithms (such as Dijkstra's or the Bellman-Ford algorithm) for energy-efficient reliable 
routing, the link cost must now be a function of not just the associated transmission energy, 
but the link error rates as well. A link is assumed to exist between node pair {/, j) as long 
as node j lies within the transmission range of node i. This transmission range is uniquely 
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defined for the constant-power case. For the variable-power case, this range is really the 
maximum permissible range corresponding to the maximum transmission power of a sender. 
Let Ejj be the energy associated with the transmission of a packet over link j , and pij be 
the link packet error probability associated with that link. In the fixed-power scenario, E t j 
is independent of the link characteristics; in the variable-power scenario, Ejj is a function 
of the distance between nodes and j. Now, the routing algorithm's job is to compute the 
shortest path from a source to the destination that minimizes the sum of the energy costs 
over each constituent link. 

Choosing path P for communication between S and D implies that the total energy cost 
for HHR is 



E '- H ' (20.5) 



Pi.i+1 



Choosing a minimum-cost path from node 1 to node N + 1 is thus equivalent to choosing 
the path P that minimizes Equation (20.5). It is thus easy to see that the corresponding link 
cost for link L,j, denoted dj, is given by C,j = £,j/(l — Pij)- Ad-hoc routing protocols, 
discussed in Chapter 13, such as AODV, DSR and TORA, can use this link cost to com- 
pute the appropriate energy-efficient routes. Some of the existing energy-efficient routing 
techniques, e.g. PARO, can also be easily adapted to use this new link cost formulation to 
compute minimum-energy routes. Thus, in such a modified version of the PARO algorithm, 
an intermediate node C would offer to interject itself between two nodes A and B if the 
sum of the link cost Ca,c + Cq,b was less than the 'direct' link cost Ca,b- 

In end-to-end retransmissions, the total energy cost along a path contains a multiplicative 
term involving the packet error probabilities of the individual constituent links. In fact, 
assuming that transmission errors on a link do not stop downstream nodes from relaying 
the packet, the total energy cost can be now expressed as: 

E P = (20.5a) 

Given this form, the total cost of the path cannot be expressed as a linear sum of in- 
dividual link costs, thereby making the exact formulation inappropriate for traditional 
minimum-cost path computation algorithms. In [1] a heuristic cost function for a link, was 
suggested as 

C, j = (20.6) 

J d-Pij) L 

where L = 2,3, ... , and is chosen to be identical for all links. Clearly, if the exact path 
length is known and all nodes on the path have identical link error rates and transmission 
costs, L should be chosen equal to that path length. However, in accordance with current 
routing schemes, we require that a link should associate only a single link cost with itself, 
irrespective of the lengths of specific routing paths that pass through it. Therefore, we 
need to fix the value of L independent of the different paths that cross a given link. If better 
knowledge of the network paths is available, then L should be chosen to be the average path 
length of this network. Higher values of L impose progressively stiffer penalties on links 
with non-zero error probabilities. Given this formulation of the link cost, the minimum-cost 
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path computation effectively computes the path with the minimum "approximate" energy 
cost given by: 

As before, protocols like AODV, DSR, TORA and PARO can use this new link cost function 
to make their routing decisions. 



20.3 MAXIMIZING NETWORK LIFETIME 

We now discuss how we can include the retransmission-aware formulation of the link cost 
in an algorithm, that attempts to increase the operational lifetime of multihop wireless 
networks. Unlike previous protocols, maximum reliable packet carrying capacity (MRPC) 
considers both the node characteristics (residual battery energy at the transmitting node) and 
the link characteristics (link distance and link error rates), while evaluating the suitability 
of alternative paths. Given the current battery power levels at the different nodes, MRPC 
selects a route that has the maximum reliable packet carrying capacity among all possible 
paths, assuming no other cross-traffic passes through the nodes on that path. 

To formalize the algorithm, let us assume that the residual battery power at a certain 
instance of time at node i is B, . As before, let the transmission energy required by node i 
to transmit a packet over link (i, j) to node j be Ejj. Let the source and destination nodes 
for a specific session (route) be S and D respectively. If the route-selection algorithm then 
selects a path P from S to D that includes the link (i, j), then the maximum number of 
packets that node ;' can forward over this link is clearly B, / E t j . Accordingly, we can define 
a node-link metric, M, j for the link (i, j) as: 

Mfj = A (20.8) 

The key point in this formulation is that the cost metric includes both a node-specific 
parameter (the battery power) and a link-specific parameter (the packet transmission energy 
for reliable communication across the link). Clearly, the 'lifetime' of the chosen path P, 
denned by the maximum number of packets that may be potentially forwarded between S 
and D using path P, is determined by the weakest intermediate node - one with the smallest 
value of Mjj. Accordingly, the 'lifetime' 3 associated with route P is: 

S P = min {Mi :} (20.9) 

The MRPC algorithm then selects the candidate route P c that maximizes the 'lifetime' of 
communication between S and D. Formally, the chosen route is such that: 

P c = argmax{3/)|P e all possible routes] (20.10) 

Given the cost and lifetime formulations for MRPC it is then easy to use a modified version 
of Dijkstra's minimum cost algorithm for decentralized route computation. 

To apply Dijkstra's algorithm for determining the minimum-cost path, the distance metric 
from any node to the given destination should be denned as the value of 3p over the optimal 
path from that node to D. Now consider a node A that sees advertisements from its neighbors, 
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{X, Y, Z, . . .}, with corresponding distance metrics %x, 3y, Sz, ... for a given destination 
D. Node A can then compute the best path to D (using its optimal neighbor) by using the 
following simple algorithm: 

(1) For each of the neighboring nodes (j e {X, Y, Z, . . .}), compute the link cost M A j 
using Equation (20.8). 

(2) For each of the neighboring nodes (j e {X, Y, Z, . . .}) compute the potential new 
value of 3 pot using 3 po t(A, j) — min{M A j, 

(3) Select as the next-hop neighbor towards D the node which results in the maximum 
value of 3p 0t , i.e. choose node k such that k — argmax je {x,y,z}{^pot(^> j)) an d 
assign \s A — 3 pot (A, k). 

Using this recursive formulation allows all nodes in the ad hoc network to iteratively 
build their optimal route towards a specific destination D. The distance- vector formulation 
presented here can easily be incorporated in protocols, such as AODV, DSR and TORA, 
that are specifically designed for ad hoc mobile environments. 

The basic MRPC formulation for power-aware routing does not need to specify the value 
of the transmission energy cost associated with a specific link. Note that Equation (20.8) 
is expressed as a function of a generic link cost M, j. Accordingly, by specifying different 
forms of Mjj it is possible to tailor the MRPC mechanism for specific technologies and/or 
scenarios. 

For the fixed-power scenario, the energy involved in a single packet transmission attempt, 
Eij, is a constant for all (i, j) and is independent of the distance between neighboring 
nodes (' and j. For the variable-power scenario, Ejj will typically be oc Dfj, where D,j 
is the distance between nodes i and j. A routing algorithm for reliable packet transfer 
should include the link's packet error probability in formulating the transmission energy 
cost. By ignoring the packet error probability, the link cost concentrates (wrongly) only on 
the energy spent in transmitting a single packet. The correct metric is the effective packet 
transmission energy for reliable transmission, which includes the energy spent in one or more 
re-transmissions that might be necessary in the face of link errors. A transmission energy 
metric of the form C,j = — Pij) L was suggested, where p t j is the link's packet 

error probability, L > 1 . For hop-by-hop re-transmissions L should be chosen to be 1 . In 
the absence of hop-by-hop re-transmissions (i.e. re-transmissions are only performed end- 
to-end), the transmission cost is well approximated by L e [3,5]. Power-aware routing has 
been studied in a number of papers [1-42]. 

The conditional MRPC (CMRPC) algorithm is the MRPC equivalent of the conditional 
min-max minimum battery cost routing (CMMBCR) algorithm presented in Toh et al. [2]. 
The CMMBCR algorithm is based on the observation that using residual battery energy as 
the sole metric throughout the lifetime of the ad hoc network can actually lower the overall 
lifetime, since it never attempts to minimize the total energy consumption. Accordingly, 
the CMMBCR algorithm uses regular minimum-energy routing as long as there is even one 
candidate path, where the remaining battery power level in all the constituent nodes lies 
above a specified threshold y . When no such path exists, CMMBCR switches to MMBCR, 
i.e. it picks the path with the maximum residual capacity on the 'critical node'. 

The CMRPC algorithm differs from CMMBCR in that the cost-functions at all times 
include the link-specific parameters (e.g. error rates) as defined earlier in this chapter. The 
algorithm can thus be specified as follows. Let ^ be the set of all possible paths between 
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the source S and destination D and let Q represent the set of paths such that: for any route 
gef2,3g>y.ln other words £2 represents the set of paths whose most critical nodes 
have a lifetime greater than a specified threshold. The routing scheme thus consists of the 
following actions: 

(1) If £2 ^ 0 (there are one or more paths with 3 > y, the algorithm selects a path 
Q e £2 that minimizes the total transmission energy for reliable transfer, i.e. 



Q — argmin 




(20.11) 



(2) Otherwise, switch to the MRPC algorithm, i.e. select Q such that 

Q = argmax{3 e |g e 

The threshold y is a parameter of the CMRPC algorithm. A lower value of y implies a 
smaller protection margin for nodes nearing battery power exhaustion. Accordingly, the 
performance of the CMRPC algorithm will be a function of y . 

The performance example is based on network topology shown in Figure 20.2. The 
corner nodes and the mid-points of each side of the rectangular grid were chosen as traffic 
sources and destinations; the bold lines in the figure show the session end-points [1]. 

Each (source, destination) pair had two simultaneous sessions activated in the opposite 
direction, giving rise to a total of 16 different sessions. For the results reported here, each 
session consisted of a UDP traffic generated by a CBR source whose inter-packet gap was 
distributed uniformly between 0.1 and 0.2 s. The error rate on each link was independently 
distributed uniformly between (0.05, p max ). Varying values of p m . dx were used. Routes were 
recomputed at 2 s intervals in these simulations to capture the effect of changes in the 
residual packet capacity on the link metrics. 

Whenever nodes died (when its battery power gets completely drained) during the course 
of a simulation, the simulation code would check whether the graph became partitioned. The 
simulations were run until each of the 16 sessions failed to find any route from their source 
to the corresponding destination. To avoid the termination of a simulation due to battery 
power exhaustion at source or destination nodes, all source and sink nodes were configured 




Figure 20.2 Simulation scenario. 
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Figure 20.3 Expiration sequence for different algorithms, ^? = 1.5. (Reproduced by per- 
mission of IEEE [1].) 

to have 'infinite' power resources. All the other 'intermediate' nodes were configured with 
identical initial battery power levels. 

To study the performance of the various algorithms, experiments were performed where 
the maximum transmission radius, R, of each node was varied. Figure 20.2 shows the set 
of neighboring nodes for a corner node when the transmission radius is set to 1.5. The 
expiration sequence, as well as the node expiry times were noted, for each simulation. 
The expiration sequence (sorted in ascending order of the expiration times) provides a 
useful indicator of how each algorithm affects the lifetime of the individual nodes, and the 
entire network. In addition to the expiration sequence, the total packet throughput was also 
calculated by counting the total number of packets successfully received at the destination 
nodes, and the energy costs per packet by dividing the total energy expenditure by the total 
packet throughput. Except for the expiration sequences, all other metrics were obtained 
by averaging over multiple runs. The results are shown in Figures 20.3-20.5. From these 
results one can see that CMRPC/MRPC outperforms other options. 



20.4 ENERGY-EFFICIENT MAC IN SENSOR NETWORKS 

Among the requirements for MACs in wireless sensor networks, energy efficiency is typ- 
ically the primary goal. In these systems, idle listening is identified as a major source of 
energy wastage. Measurements show that idle listening consumes nearly the same power as 
receiving. Since in sensor network applications traffic load is very light most of the time, it 
is often desirable to turn off the radio when a node does not participate in any data delivery. 
Some schemes put (scheduled) idle nodes in power-saving mode (SMAC) and switch nodes 
to full active mode when a communication event happens. Although a low duty cycle MAC 
is energy-efficient, it has three side-effects. 

(1) It increases the packet delivery latency. At a source node, a sampling reading may 
occur during the sleep period and has to be queued until the active period. An 
intermediate node may have to wait until the receiver wakes up before it can forward 
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Figure 20.4 (a) Total packet throughput; (b) average transmission energy per received 
packet (UDP sources), R = 1.5. (Reproduced by permission of IEEE [1].) 
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Figure 20.5 CMRPC: (a) total packet throughput; (b) average, transmission energy per 
received packet vs the protection threshold. (Reproduced by permission of 
IEEE[1].) 
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a packet received from its previous hop. This is called sleep latency in SMAC, and it 
increases proportionally with hop length by a slope of schedule length (active period 
plus sleep period). 

(2) A fixed duty cycle does not adapt to the varying traffic rate in sensor network. A 
fixed duty cycle for the highest traffic load results in significant energy wastage 
when traffic is low while a duty cycle for low traffic load results in low message data 
delivery and long queuing delay. Therefore it is desirable to adapt the duty cycle 
under variant traffic load. 

(3) A fixed synchronous duty cycle may increase the possibility of collision. If neigh- 
boring nodes turn to active state at the same time, all may contend for the channel, 
making a collision very likely. There are several possibilities to reduce sleep delay 
and adjust duty cycle to the traffic load. Those mechanisms are either implicit, in 
which nodes remain active on overhearing an ongoing transmission or explicit, in 
which there are direct duty cycle adjusting messages. In adaptive listening, a node that 
overhears its neighbor's transmission wakes up for a short period of time at the end 
of the transmission, so that if it is the next hop of its neighbor, it can receive the 
message without waiting for its scheduled active time. A node also can keep listen- 
ing and potentially transmitting as long as it is in an active period. An active period 
ends when no activation event has occurred for a certain time. The activation time 
events include reception of any data, the sensing of communication on the radio, the 
end-of-transmission of a node's own data packet or acknowledgement, etc. 

If the number of buffered packets for an intended receiver exceeds a threshold L, the sender 
can signal the receiver to remain on for the next slot. A node requested to stay awake sends 
an acknowledgement to the sender, indicating its willingness to remain awake in the next 
slot. The sender can then send a packet to the receiver in the following slot. The request is 
renewed on a slot-by-slot basis. 

However, in previous mechanisms (whether explicit or implicit), not all nodes beyond 
one hop away from the receiver can overhear the data communication, and therefore packet 
forwarding will stop after a few hops. This data forwarding interruption problem causes 
sleep latency for packet delivery. 

DMAC employs a staggered active/sleep schedule to solve this problem and enable 
continuous data forwarding on the multihop path. In DMAC, data prediction is used to 
enable active slot request when multiple children of a node have packets to send in a same 
sending slot, while the more to send packet is used when nodes on the same level of the 
data gathering tree with different parents compete for channel access. 

20.4.1 Staggered wakeup schedule 

For a sensor network application with multiple sources and one sink, the data delivery paths 
from sources to sink are in a tree structure, a data gathering tree. Flows in the data gathering 
tree are unidirectional from sensor nodes to sink. There is only one destination, the sink. All 
nodes except the sink will forward any packets they receive to the next hop. The key insight 
in designing a MAC for such a tree is that it is feasible to stagger the wake-up scheme so 
that packets flow continuously from sensor nodes to the sink. DMAC is designed to deliver 
data along the data gathering tree, aiming at both energy efficiency and low latency. 
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In DM AC, the activity schedule of nodes on the multihop path is staggered to wake 
up sequentially like a chain reaction. Figure 20.6 shows a data gathering tree and the 
staggered wake-up scheme. An interval is divided into receiving, sending and sleep periods. 
In receiving state, a node is expected to receive a packet and send an ACK packet back to 
the sender. In the sending state, a node will try to send a packet to its next hop and receive 
an ACK packet. In sleep state, nodes will turn off radio to save energy. The receiving and 
sending periods have the same length of //, which is enough for one packet transmission 
and reception. Depending on its depth d in the data gathering tree, a node skews its wake-up 
scheme d/i ahead from the schedule of the sink. In this structure, data delivery can only be 
done in one direction towards the root. Intermediate nodes have a sending slot immediately 
after the receiving slot. 

A staggered wake-up schedule has four advantages: (1) since nodes on the path wake 
up sequentially to forward a packet to next hop, sleep delay is eliminated if there is no 
packet loss due to channel error or collision; (2) a request for longer active period can be 
propagated all the way down to the sink, so that all nodes on the multihop path can increase 
their duty cycle promptly to avoid data stuck in intermediate nodes; (3) since the active 
periods are now separated, contention is reduced; and (4) only nodes on the multihop path 
need to increase their duty cycle, while the other nodes can still operate on the basic low 
duty cycle to save energy. The simulation results for the three different protocols are shown 
in Figure 20.7. DMAC demonstrates good performance. In the simulation the following 
parameters were used, as in Lu et al. [43]: Radio bandwidth 100 kbps, radio transmission 
range 250 m, radio interference range 550 m, packet length 100 bytes, transmit power 
0.66 W, receive power 0.395 W and idle power 0.35 W. The sleeping power consumption 
is set to 0. An MTS (more to send) packet is 3 bytes long. According to the parameters of 
the radio and packet length, the receiving and sending slot /x is set to 10ms for DMAC and 
1 1 ms for DMAC/MTS. The active period is set to 20 ms for SMAC with adaptive listening. 
All schemes have the basic duty cycle of 10 %. This means a sleep period of 180 ms for 
DMAC and SMAC, 198 ms for DMAC/MTS. 
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Figure 20.7 (a) Mean packet latency on each hop under low traffic load; (b) total energy 
consumption on each hop under low traffic load. 
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21 



Quality -of -Service 
Management 

QoS has been the main criterion in the analysis of the schemes presented so far in the book. 
However, in the last chapter we present some additional solutions that will be of interest in 
4G networks. 



21 .1 BLIND QoS ASSESSMENT SYSTEM 

In this section we present a method to blindly estimate the quality of a multimedia com- 
munication link using digital fragile watermarking. Data hiding by digital watermarking is 
usually employed for multimedia copyright protection, authenticity verification or similar 
purposes. However, watermarking is here adopted as a technique to provide a blind measure 
of the quality of service in multimedia communications [1-28]. The watermark embedding 
procedure is sketched in Figure 21.1. It consists of embedding a watermark sequence, which 
is usually binary, into host data by means of a key. In the detection phase, the key is used 
to verify the presence of the embedded sequence. With regard to the domain where the 
watermark embedding occurs, we can distinguish methods operating in the spatial domain 
[15], in the discrete cosine transform DCT domain [16-19], in the Fourier transform domain 
[20], and in the wavelet transform domain [1-5]. 

When unwanted modifications of the watermarked data affect even the extracted water- 
mark, the embedding scheme is known as fragile. Fragile watermarking [6-8] can be used 
to obtain information about the tampering process. In fact, it indicates whether or not the 
data has been altered and supplies localization information as to where the data was altered. 

Here, an unconventional use of a fragile watermark to evaluate the QoS in multimedia 
mobile communications is presented. Specifically, a known watermark is superimposed onto 
the host data. The rationale behind this approach is that, by transmitting the watermarked 
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Figure 21.1 Watermark embedding process. 
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Figure 21.2 Principle scheme of tracing watermarking for coder-channel quality assessment 
in multimedia communications. 



data onto a channel, the mark undergoes the same alterations as suffered by the data. At 
the receiving side, the watermark is estimated and compared with the original. Since the 
alterations endured by the watermark are also likely to be suffered by the entire data, as they 
follow the same communication link, the watermark degradation can be used to estimate 
the overall alterations endured by the data. 

The tracing watermarking procedure for coder-channel quality assessment is given in 
Figure 21.2. The watermark embedding is performed by resorting to the spread-spectrum 
technique proposed in Cox et al. [17] for still images and applied to video sequences in 
Hartung and Girod [18]. The watermark is a narrowband low-energy signal. It is then spread 
so that it has a larger bandwidth. Consequently, the watermark energy contribution for each 
host frequency bin is negligible, which makes the watermark imperceptible. 

In the application addressed here, a system embedded into the data stream that is able 
to trace the degradations introduced by the transmission system composed by the coder- 
channel cascade and not perceptually affecting the data themselves is described. 
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A set of uncorrelated pseudorandom noise (PN) matrices is multiplied by the reference 
watermark. Both the PN matrices and the embedded watermark are known at the receiving 
side. The watermark is the same for each video sequence frame, whereas the PN matrices 
are different for each frame. This insures that the spatial localization of the mark is different 
frame-by-frame so that the watermark visual persistency is negligible. 

After the randomization of the watermark by the PN matrices, the embedding of the 
tracing marks is performed in the DCT domain. The watermark is embedded in the DCT 
middle-band frequencies of the whole image. After the inverse DCT (IDCT) has been 
performed on each frame, the whole sequence is coded by a video coder and, finally, 
transmitted. 

At the receiving side, the video is first decoded; then, from the DCT of each received 
frame of the sequence, a matched filter extracts the spread watermark, which is finally 
despread using the known PN matrices. After having extracted the received watermark, it 
is matched to the reference one, which is known at the receiving side, and the mean-square 
error (MSE), between the original mark and the received one, is used as an index of the 
degradation affecting the received watermark. 



21.1.1 System modeling 

A two-dimensional video sequence will be represented as { f t [n i , n 2 ] , i = 1,2, ... M}. The 
sequence is composed of M frames fi[n\,n 2 ] of N\ x N2 pixels. Let co[ki,k 2 ] be the 
employed watermark, having dimensions K\ x K 2 and {pi[k\, k 2 ], / = 1,2,... M] the M 
PN matrices of dimensions K\ x K 2 . The watermark co\_k\, k 2 ] is a visual pattern, like a 
logo, consisting of binary pixels. 

The PN matrices {pi[k\, k 2 ],i — 1, 2, ... M] are employed to spread a narrowband sig- 
nal (watermark) over a much larger bandwidth signal (host frame) in such a way that 
the watermark energy is undetectable in any single frequency of the host image. This 
can be expressed as follows: cof [k\, k 2 ] = co[k\, k 2 ] ■ Pi[k\, k 2 ], i = 1, 2, . . . , M, where 
cJf\ki, k 2 ] is the spread version of the watermark to be embedded in the ith frame. Let 
Fi[ki, k 2 ] = DCT{/i[«i, n 2 ]} be the DCT transform of the ith frame fi\n\, n 2 ]. The spread 
watermark is embedded, in the DCT domain, in the middle-high frequency region S of 
Fi[k\, k 2 ]. 

The embedding is performed in the DCT domain according to the following rule: 

p^rz- n \ F \- k i>k 2 ] +aa> < f ) [k 1 ,k 2 ], (k u k 2 )eS n * n 

Pi Yk\,k 2 \ = \ . . , , (21.1) 

[F[ki,k 2 ], (k x ,k 2 )£S 

where F^[k\, k 2 ] represents the DCT of the ith watermarked frame fi[ti\, n 2 ], and a is a 
scaling factor that determines the watermark strength. Parameter a must be chosen in such 
a way as to compromise between not degrading the picture on one side and being detectable 
by the tracing algorithm on another. 

The ith watermarked frame is then obtained by performing the IDCT transform 
f i [«i, n 2 \ — IDCT {F t [k\ , k 2 ]}. Finally, the whole sequence is coded and then transmit- 
ted through a noisy channel. If [n 1 , n 2 ] , 1 : — 1 , 2, ... M] is the received video sequence 
then the DCT transform in the receiver gives F^ ) [k u k 2 ] = DCT{^ (a,) [m, n 2 ]}. 
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The middle-high frequency region of embedding S is selected. Then, the corresponding 
portion of F\ m \k\, £2] is multiplied by the watermark co[k\, kj\, which is known at the 

(s) 

receiving side, giving an estimation &>• [k\ , kj\ of the spread version of the watermark em- 
bedded in the ith frame, c$\ki, k{\ — F ; (m) [A:i, k{\ ■ co[ki, k{\- The dispreading operation, 
for the generic ;th frame gives &>i[k\, k 2 ] = a>\ s [ki, k 2 ] • pi[k\, k 2 ]. Finally, the watermark 
is estimated by averaging the dispread watermark over the M transmitted frames 

j M 

&[k u k 2 ] = —Y&^kuh] (21.2) 

h 

The QoS is evaluated by comparing the extracted watermark with the original one. Formally 
this can be represented as 

MSE ; = T Yiuiikuh] - S> i lk x ,k 2 [f (21.3) 

^1^2 Kl=lkl=l 

and 

^ M 

MSE = — MSE i (21.4) 

1=1 

In experiments presented in References [26, 28], the dimensions of the video sequences 
employed have been properly chosen in order to simulate a multimedia service in a UMTS 
scenario. Therefore, QCIF (144 x 176) video sequences, which well match the limited 
dimensions of a mobile terminal's display, have been employed. Sample results are shown 
in Figures 21.3 and 21.4. As shown in the figures, the quality degradation of the watermark 
embedded into the host video has the same behavior as the one affecting the video. 
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Figure 21.3 Watermark MSE and video sequence MSE (normalized to 1) vs the BER for 
the sequence 'Akiyo' MPEG-2 coded at 600 kb/s. 
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Figure 21.4 Watermark MSE and video sequence MSE (normalized to 1) vs the BER for 
the sequence 'Akiyo' MPEG-2 coded at 200 kb/s. 

21.2 QoS PROVISIONING IN WLAN 

As an example system, the 802.11 WLAN consists of basic service sets (BSS), each of 
which is composed of wireless stations (STA). The WLAN can be configured as an ad hoc 
network (an independent BSS) or an infrastructure network (composed of an access point 
and the associated STAs). 

The channel access for the STAs in a BSS is under the control of a coordination function. 
The 802.11 MAC protocol provides two coordination functions: distributed coordination 
function (DCF) and point coordination function (PCF). The DFC is a contention-based 
access scheme using carrier sense multiple access with collision avoidance (CSMA/CA). 

Priority levels for access to the channel are provided through the use of interframe 
spaces such as short interframe space (SIFS) and distributed interframe space (DIFS). 
The backoff procedure is used for collision avoidance, where each STA waits for a backoff 
time (a random time interval in units of slot times) before each frame transmission. The PCF 
provides contention-free frame transmission in an infrastructure network using the point 
coordinator (PC), operating at the access point (AP), to determine which STA currently 
gets the channel access. The DCF and the PCF can coexist by alternating the contention 
period (CP), during which the DCF is performed, and the contention-free period (CFP), 
during which the PCF is performed. A CFP and a CP are together referred to as a repetition 
interval or a superframe. Different aspects of 802. 1 1 WLAN are discussed in References 
[29-40]. The performance analysis of the DCF was studied in References [29, 30, 35]. The 
performance of DCF degrades in high traffic loads due to serious collisions. The CSMA/CA 
is not suitable for data traffic at higher channel speeds [29] due to the large waste on the 
backoff time. The influence of various sizes of the backoff time on the channel throughput 
and the optimal setting of the backoff time was studied in Cali et al. [31]. A common 
technique is to adjust the backoff time according to the traffic priority. The PCF based on a 
polling scheme is suitable for time-bounded real-time traffic. The simple polling schedules 
for voice traffic and video traffic are presented in Crow et al. [35]. Some complex polling 
schedules are proposed in References [32-34, 36, 39]. 
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To expand support for applications with QoS requirements, the IEEE 802. 1 IE task group 
[37] is proceeding to build the QoS enhancements of the 802. 1 1 MAC. The techniques for 
providing prioritized and parameterized QoS data deliveries have been discussed in the task 
group. An enhanced DCF was proposed, where each traffic flow is assigned with a different 
backoff time whose value decreases with increasing traffic priority, to achieve the prioritized 
QoS data delivery. To guarantee the bounded delay requirements in the parameterized QoS 
data delivery, the hybrid coordination function (HCF) was proposed [42]. In the HCF, 
an STA can be guaranteed to issue the frame transmission even during the CP using the 
contention-free burst (CFB). The CFB can be considered a temporary CFP during which 
the transmissions of STAs are coordinated by the polling scheme as the PCF. Moreover, 
a multipolling mechanism (called contention-free multipoll, CF-multipoll) was proposed 
[41] to reduce the polling overhead that the traditional PCF suffers from. In the multipoll, 
the PC can poll more than one STA simultaneously using a single polling frame. 

In this section, we consider how to efficiently serve real-time traffic in the IEEE 802. 1 1 
WLAN by using the multipolling mechanism. The multipolling mechanism can increase the 
channel utilization and is robust in mobile and error-prone environments. The mechanism 
can be used in the PCF and the HCF. Moreover, a polling schedule is provided to guarantee 
the bounded delay requirements of real-time flows. 



21.2.1 Contention-based multipolling 

In the PCF or the HCF, each STA in the polling list takes a polling frame when polled. This 
polling scheme is called SinglePoll. The number of polling frames for a polling list can 
be reduced if a multipolling mechanism is used. Here, we discuss an efficient multipolling 
mechanism that has the advantages of high channel utilization and low implementation 
overhead. This multipolling mechanism, referred to as contention period multipoll (CP- 
Multipoll), incorporates the DCF access scheme into the polling scheme. 

In the DCF, any contending STA for the channel will select a backoff time in units of 
slot times and execute the backoff procedure as follows: if the channel is sensed idle for a 
DIFS period, an STA starts the transmission immediately. Otherwise, an STA should defer 
until the channel becomes idle for a DIFS period and then the backoff time is decreased. 
If the channel is idle for a slot time, an STA decreases the backoff time by one or else 
freezes the backoff time. When the backoff time becomes zero, an STA begins the frame 
transmission. If a collision occurs, the STA duplicates the backoff time used in the last 
transmission and executes the backoff procedure again. In the DCF, the virtual carrier 
sensing using the Network Allocation Vector (NAV) is performed at the MAC sublayer. The 
information on the duration of a frame exchange sequence for one STA is included in the 
frame header (duration field) and is announced to other STAs. Other contending STAs will 
wait for the completion of the current frame exchange by updating their NAVs according to 
the announced duration information. For a communication the RTS (request to send) and 
CTS (clear to send) frames are exchanged before the data frame transmission. Other STAs 
defer their channel access by setting their NAVs according to the duration field in the RTS, 
the RTS, or the data frame. The exchange of RTS/CTS frames can also avoid the hidden 
terminal problem [35]. 

The basic idea of CP-Multipoll is to transform the polling order into the contend- 
ing order which indicates the order of winning the channel contention. Different backoff 
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time values are assigned to the flows in the polling group and the corresponding STAs execute 
the backoff procedures after receiving the CP-Multipoll frame. The contending order of 
these STAs is the same as the ascending order of the assigned backoff time values. There- 
fore, to maintain the polling order in a polling group, we can assign the backoff time value 
incrementally according to the expected polling order. 

The CP-Multipoll has the following advantages: a polled STA can hold the channel 
access flexibly depending on the size of local buffered data, so it becomes easy to deal 
with the data burst; if a polled STA makes no response to the CP-Multipoll, other STAs in 
the same polling group will detect the channel idle right away and advance the starting of 
channel contention. Therefore, the CP-Multipoll can decrease the waste of channel space 
and can afford any polling error. 

21.2.2 Polling efficiency 

We consider the environment with overlapping BSSs. In the overlapping BSS, we distinguish 
the STAs associated with one BSS from the STAs associated with neighboring BSSs by 
using the terms 'internal STAs' and 'external STAs', respectively. Moreover, a STA that 
will cause an internal collision is called a nonbehaved STA; otherwise, the STA is called a 
behaved one. 

The following terminology is used in the performance analysis: 

• frame jium, maximum number of data frames allowed to be transmitted in a TXOP 
(transmission opportunity); 

• l t ype, number of bits in a 'type' frame; 

• t t ype, transmission time on the channel for a 'type' frame; 

• single-poll, the single polling frame in the single poll (i.e. CF-Poll frame); 

• n, poll size (i.e. the number of poll records in a multipolling frame); 

• n-poll, the multipolling frame with n poll records; 

• ERR tvpe , probability that a 'type' frame is dropped due to bit errors; 

• a, probability that an STA has no data to send when polled; 

• P, probability that an STA becomes a nonbehaved STA; 

• h, number of PCs operating in the overlapping space; 

• InitBT, initial backoff time; 

• E, polling efficiency. 

The following assumptions will be used in the analysis: 

(1) The PC (point coordinator) always performs the initial backoff before sending any 
polling frame. 

(2) The data frame is transmitted without any acknowledgment. When an STA is polled, 
the STA either sends a half of frame jium data frames on average or sends a null data 
frame if there is no data to send. 
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Figure 21.5 The state diagram of the multipolling mechanism. (Reproduced by permission 
of IEEE [43].) 

(3) There is an equal probability BER for a bit error to occur due to the channel noise 
(interference, fading or multipath). Hence, ERR type can be expressed as 1 — (1 — 
BERfy^. 

(4) An STA becomes a nonbehaved one if the STA fails to receive the CTS frame from 
the PC. That is, B = ERR CT s- 

If AvgD and AvgT denote the total number of bits in the data frames successfully sent 
from the polled STAs and the average complete time in time units for a poll, respectively, 
then the polling efficiency is defined as E — AvgD/ AvgT. This represents the average uplink 
data rate during a poll. 

In the single poll, a polled STA contributes data frames if the STA successfully receives 
a CF-Poll frame and has pending data frames to be successfully transmitted. The polled 
STA may suffer frame error due to interference from external STAs. Therefore, 

AvgD = (1 - a) • frame jium/2 ■ /data_frame 

(1 — £/y?data_ frame) ' (1 ~ -E^single.poll) (21.5) 

The polled STA will give a response to the PC after an SIFS period for a successful CF-Poll. If 
a polled STA does not respond to the PC after a PIFS period for a failed CF-Poll, the PC takes 
over the channel control and may send the next CF-Poll frame. In the HCF, the RTS/CTS 
frames should be exchanged before the data transmission to prevent the interference from 
other STAs. Since the HCF is being substituted for the PCF in the 802. HE, we consider 
the single poll of the HCF in our analysis. Therefore, 

AvgT = MtBT+ (f s ingle-poll + PIFS) ■ Swingle-poll + [Jingle-poll + SIFS 

+ ?rts + fcTS + 2SIFS + (1 - a)framejium/2 ■ (t d . dVdJmme + SIFS) (21.6) 

+a(^null_frame + SIFS)] ■ (1 — ^^single-poll) 

Next, we use a state diagram to represent the situation after sending a multipolling frame 
with poll size n. In Figure 21.5, the state m : fail(0 < m < n) represents the event that 
there are m STAs in the polling group which failed to receive the multipolling frame. Let 
Pn,m denote the probability of the state in '.fail. Eet D nm and T nm denote the amount of 
uplink data frames in bits and the total time duration in time units under the state m : fail, 
respectively. Generally, we have the following values: 

P n ,m — I I • (ERR n -po\\) m • (1 — ERR„^ po i\) n ~ m 
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AvgD=Y^P n , m - D nM (21.7) 

m=0 

n 

AvgT— InitBT+ ^ P n m ■ T n , m 

m=0 

For the CF-Multipoll, the external STAs in the overlapping BSS will have their TXOPs 
overlap the ones allocated to the internal STAs. We assume that the interference from 
external STAs has been included in the parameter BER. Hence, 

D n ,m = (n - m) ■ (1 - a) ■ frame _num/2 ■ / d ata_frame • (1 - ERR d . dt . d _ {mme ) 

Also, each successive TXOP starts an SIFS period after the predecessor's TXOP limit 
expires in the CF-Multipoll. Note that the time spent by a polled STA is fixed regardless of 
the number of pending data frames. Hence, 

T„, m — f„-poii + n ■ SIFS + n ■ frame. num ■ (fdata-frame + SIFS) (21.8) 

In the CP-Multipoll, there are two processing phases: one is for the normal multipoll and the 
other is for the error recovery. In the normal phase, there are (n — m) STAs successfully re- 
ceiving the multipolling frame under the state m : fail. Among these STAs, (1 — fr)(n — m) 
STAs are behaved STAs and fi(n — m) STAs are nonbehaved ones. Each nonbehaved STA 
is assumed to destroy one data frame transmitted by a behaved one. In the recovery phase, 
m STAs failed to receive the multipolling frame and (n — m) nonbehaved STAs will be 
served individually using the CP-Multipoll with poll size one. The analysis of this phase is 
similar to the one in the single poll. Hence, D„ tm has the following value: 

t\ rjnormal _i_ rjrecovery 

Un,m — u n ^ m ' u n,m 

£>°°™ al = [(1 -fr).(n-m)- frame.num/2 - P(n - m)] 

(1- «) -/data _frame 

• (1 — Hy?data_frame) (21.9) 

D^ 0very = [m + P(n - m)] ■ (1 - a) ■ frame _num/2 

^data-frame ' (1 ~ -E^data_frame) • (1 — £7?/?i_ po u) 

The length of the time of the normal phase is dominated by the transmission time of those 
(1 — fJ)(n — m) behaved STAs. The total backoff time consumed in the normal phase is 
h x n + 1, regardless of the value m. Hence, 

^normal , ^recovery 

J-n.m — J-n^m T l„ m 

Cm™ 1 = f »-poii + (*■«+!)■ Slot +{\-p).(n-m) (21.10) 
(tos + ?cts + 2 SIFS + (1 - a) ■ frame.num/2 ■ (?data_frame + SIFS) 

+ « ■ ?null_frame)7; i re r ery = N + P<fl ~ m)] ■ [(?!_ po u 

+ 2 Slot) ■ £fl/?i_poii + (fi-poii + 2 Slot + f RTS + fcTS + 2 SIFS + (1 - a) 

frame.num/2 ■ (f d ata_frame + SIFS) + a ■ ? nu ll_f rame ) 

(i - sy?i_poii)]. 

The superpoll can poll a group of STAs together as the CF-Multipoll and the CP-Multipoll. 
In the superpoll, the polled STA will attach the poll records of those polled ones whose 
polling orders are after it to its current transmitted data frame. This scheme can be considered 



826 



QUALITY-OF-SERVICE MANAGEMENT 



Table 21.1 System and poll-related parameters. (Reproduced by permission of IEEE [43]) 



Parameter 



Value 



Channel rate 

PHYheader 

MAC header 

Slot 

SIFS 

DIFS 

PIFS 

Superframe 
CFP_Max_Duration 

frame-num 

ldata.-frame(octets ) 

lnull_frame(octets) 

lsingle_poll(octets) 

ln_poll(octets) 

n 

BER 
a 

h, the number of PCs operating in the 

overlapping space 
InitBT(pLs) 



11 Mbs 
192 b 
272 b 
20 [is 

10 [AS 

50 [as 
30 lis 
25 [as 
20 [as 
3 

200(default), 400, 600, 800 

34 

34 

16 + 8m 
1-20 

10~ 3 , 10~ 4 , 10- 5 (default), lO" 6 
0.2 (default), 0.4, 0.6, 0.8 

1,2 (default), 3,4 
90(default), 110, 130, 150 



as one with replicated poll records. To keep the correct polling order, each polled STA in the 
polling group should monitor the channel and check whether the previous STA has finished 
sending a data frame. If a polled STA fails to receive its poll record, the next following STA 
in the polling group will wait for a timeout before its own frame transmission. However, the 
situation where some STAs cannot listen to other STAs' channel activities is not considered. 
This may cause inconsistent setting of timers among polled STAs and may cause internal 
collisions. 



21.2.2. 1 Performance example 

The parameter settings related to system and the polling schemes are listed in Table 21.1 
[43]. The performance improvement of CP-Multipoll in percentage over other schemes is 
shown in Figure 21.6. 



21.3 DYNAMIC SCHEDULING ON RLC/MAC LAYER 

In wireless networks, the fading characteristics of the wireless physical channel may 
introduce location-dependent channel errors. A scheduling algorithm, referred to as channel- 
condition independent packet fair queueing (CIF-Q), is presented in Ng et al. [44] to 
solve the problem of location-dependent channel errors in wireless networks by suspending 
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Figure 21.6 Comparison between single poll and CP-Multipoll. 

transmission service of a connection of a mobile station (MS) when the MS is in a high 
BER region. To compensate for the service loss of the MS in a high BER region, the CIF-Q 
scheduling algorithm increases the service priority after the MS returns to a low BER re- 
gion to fulfill its QoS specification. However, this suspension may cause long delays for the 
connection of the MS, because the duration that an MS resides within a high interference 
region may be unpredictably long. In many services, such as video transmissions, minor 
errors are acceptable to the receiver-side applications, but a long delay is not. 

In Yars et al. [45], a scheduling scheme at the RLC/ MAC layer is proposed, referred to 
as a dynamic scheduling mechanism for mobile communication (DSMC). The scheme is 
discussed in the context of GPRS applications. DSMC is a dynamic scheduling architecture 
that can conform to a variety of QoS requirements in the networks without changing the 
coding rate on RLC layer. The DSMC scheduling scheme, which is based on the self-clocked 
fair queuing (SCFQ) algorithm [46], can dynamically adjust the service rate (weight) for a 
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particular connection in accordance with the channel quality. It will use the low service rate 
when an MS is within a high interference (high-IF) region to reduce bandwidth waste due 
to retransmissions, and use the high service rate when the MS is within a low interference 
(low-IF) region to fulfill the QoS requirements (e.g. delay bound and loss ratio) of the MS. 
The high and low service rates are both determined at connection setup time. Thus, the 
complexity of scheduling algorithm in wireless networks can be reduced. 



21.3.1 DSMC functional blocks 

An MS issues a connection request to the C AC controller by specifying its QoS requirements 
(Dek, Rp, R m ), its priority level e, as well as the number of blocks m to be transmitted. D e * 
is the delay bound required for the requested session k with a priority of e on air interface, 
U m . R p and R m , respectively, denote the peak and the minimum data rates requested by 
the session k. In other words, the MS requests that m blocks of the session k should be 
scheduled in a queue of priority level e, and transmitted at a minimum rate higher than R m 
or a peak rate not higher than R p before the delay bound D e k is reached. The minimum data 
rate R m is used as the low service rate when the MS is within a high interference region. On 
the other side, the peak data rate R p is not taken for granted as the high service rate when 
the MS is within a low interference region. Instead, the high service rate is determined by 
a simple calculation performed by the service rate (SR) calculator inside the radio resource 
manager. The SR calculator determines the admissible peak data rate (high service rate) R' p 
that can be supported by the BSS according to the delay bound D e ^, and the minimum data 
rate R m under a hypothetical interference model of the MS. 

As in many examples throughout the book, the interference model is based on a two- 
state Markov chain with transition probabilities a and p. These parameters a and fi can 
be collected from a user behavior profile and can be updated dynamically. After receiving 
the R' p from the SR calculator, the CAC controller can optionally accept, reject or renegotiate 
with the MS. If the connection request is accepted, the CAC controller stores the parameters 
(R'p, R m ,m,e) to a parameter database. The rate selector can then select and send the 
current data rate R to the CCU (channel codec unit) associated with the MS according to 
the interference level measured by the interference monitor. 

The DSMC scheduling architecture can be situated in a BSS of the GPRS networks 
and applied to either uplink or downlink transmission. The system will adopt a certain 
number of priority levels. As an example, in GPRS, block transmission is classified into 
five scheduling priority levels, including four data priority levels and a signal priority level, 
which is the highest priority level [47]. The DSMC scheduling architecture follows the 
specification but includes a new data priority level for the retransmission blocks. The DSMC 
scheduling architecture [45] consists of five scheduling servers for data block transmission, 
one server for each priority level. Each server is responsible for scheduling data blocks 
transmitting through a single packet data channel (PDCH). The highest priority level, PI, 
is for the retransmission blocks and the lowest priority level, P5, is for the best-effort data 
blocks. Both PI and P5 schedule data blocks in a first-come-first-served order. However, 
the scheduling servers of priority levels P2-P4 adopt the SCFQ scheduling algorithm 
in scheduling data blocks of QoS specific connections. In other words, multiple queues 
may exist in each priority of P2-P4. The queues with the same priority will be served in 
accordance with the SCFQ scheduling algorithm, to be explained in below. 
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The block requests of a particular priority can not be served until all the blocks of the 
higher priority have been served. The transmission of a block is nonpreemptive. 

The SCFQ scheduling algorithm is basically a packet-based general processor sharing 
scheme [48, 49] without the complex virtual clock tracking mechanism. The elimination 
of the virtual clock tracking mechanism makes SCFQ easier to implement on a high-speed 
network. In SCFQ, an arrival block request is tagged with a service finish time (FT) before 
it is placed in a queue. The service FT tag of a block request is computed from the service 
time and the service starting time of the block as 

block length 

FT = h max(FT of the tail block, FT of the serving block) 

service rate 

The service starting time of the block can be the FT of the tail block of the queue if the 
queue is nonempty, or it is the FT of the serving block. The block requests among the heads 
of the queues will be picked up to be served one by one in accordance with the increasing 
order of FT tags, and in a round-robin fashion if more than two heading blocks have the 
same FT tags. 



21.3.2 Calculating the high service rate 

In order to determine the service rate, we need to first obtain the total delay of a block. 
The total delay depends on the service discipline and the input traffic. Owing to the bursty 
nature of multimedia traffic streams, a leaky-bucket regulator, as discussed in Chapter 8, 
is assumed at the network interface of the sender site to regulate the block request flow of 
each session. 

In the DSMC scheduling scheme, a block of a connection will experience queueing 
delay, block transmission delay, and retransmission delay. A block at the head of a queue 
(henceforth referred to as a 'heading block') may experience a priority delay, which, in turn, 
consists of an interpriority delay caused by all block transmissions in the higher priority 
levels and an intrapriority delay resulted from the SCFQ scheduling. 

Each preceding block of the newly arrived block will experience a priority delay and a 
block transmission delay when the preceding block becomes a heading block. In addition, 
the newly arrived block will experience an intrapriority delay when it becomes the heading 
block of the queue itself. Hence, the queueing delay of a block is, thus, the summation that 
the priority delay and the block transmission delay experienced by all preceding blocks, 
plus the intrapriority delay experienced by the block. In analytical modeling the following 
notation will be used: 

• sj, session j 

• L, block length (b); 

• P^f, block error probability when the block is transmitting within a high interference 
region; 

• Pjj e , block error probability when the block is transmitting within a low-interference 
region; 

• Phi, stationary probability that the MS is within a high-interference region; 

• Pa, stationary probability that the MS is within a low-interference region; 
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• ( r e,k) h \ l° w service rate of session k of priority e; 

• ( r e,k) l \ high service rate of session k of priority e; 

• (HD e ^) hl , heading-block delay of session k of priority e within a high-interference region; 

• (ffl) ej jfc) u , heading-block delay of session k of priority e within a low-interference region; 

• b e £, mean queue length of the session k of priority e; 

• w et k, mean queueing delay with the session k of priority e; 

• dp, data priorities. 

The interpriority delay is the time that the selected heading block waits for the blocks 
from higher priority levels to be served. In the following, we refer to the priority level 
under discussion as the priority e. Let A PiS [ti, t 2 ] denote the amount of blocks arrived to 
a session with a priority p during a time interval (f\, t 2 ) and W PiS [t\, t 2 ], is the number of 
blocks served for a session with a priority p during a time interval (fi, t 2 ). As shown in 
Figure 21.7, after the (z — l)th selected heading block has been served, the z'th heading 
block selected by the server of the priority e may encounter an interpriority delay, which 
is the time period (t[, t 2 ). Each session j with a priority g higher than e must be served 
up within the time period (t[, t 2 ). Thus, the amount of traffic served during the time period 
(t[, t 2 ), W gig [t[, t 2 ], is equal to the amount of arrival traffic to the session during the time 
period (t\, t 2 ), A g j[t\, t 2 ] = A gtg [ti, t[ + At e ], as in Equation (21.11). Here, t[ is the epoch 
of the final time of serving the (z — l)th heading block selected from the priority e and At e 
the interpriority delay encountered by each heading block selected from the priority e 

W gJ [t[, t 2 ] = A gJ [t u t 2 ],Wt 2 > t[ > t u h>0 (21.11) 

Let H represent the set of all priority levels higher than e. Summing up Equation (21.11) 
for all sessions of the priorities in H, we have the following inequality: 



Vt 2 > t[ > t u h > 0 (21.12) 



where sj stands for session j. The traffic A g j[t, t + r] during a time period (f, t + r) has 
an upper bound since the DSMC scheduling architecture uses a leaky bucket to regulate 
the flow of each session [8]. Let A* j(r) denote the upper bound of A g j[t, t + r]. From 
Chapter 8 and also Cruz [50], we have the following inequality according to the leaky-bucket 




Figure 21 .7 Dealy caused by higher priority session to the particular priority. (Reproduced 
by permission of IEEE [45].) 
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constrained envelope function: 

A gJ [t,t + r]<A* gJ (r) = cr gtg + p g j-t Vr > 0, Vr > 0 (21.13) 

where a gg is the leaky-bucket size and p g j is the token arrival rate for the session j of the 
priority g. Therefore, from the above leaky-bucket constraint, we can derive the following 
inequality: 



E 

geH 



£A,,,-[fi,f 2 ] 



E 



E A L-fe-fi) 



geH \_sjeg 



Vf 2 > t[ > h , h > o 



(21.14) 



Since the retransmission of data blocks has the highest priority and there is only one 
retransmission queue, we can denote the arrival traffic to the retransmission queue during 
the period (t\, t 2 ) as Aii[?i, t 2 ]. The retransmission traffic is the aggregated traffic of the 
retransmission of all sessions. By observing the system over a substantially long period 
of time, we can find that A\^\\t\, t 2 ] also conforms to the leaky-bucket constraint, and we 
rewrite the above inequality Equation (21.1) as 



A h i[t u t 2 ]+ X! 

geh,g^l 



E A ^ [f l'' 2] 



<A* lx (t 2 -t l )+ 

g£h,g^\ 



E A L- fe - ? i) 

(21.15) 



."j eg 



v? 2 > t[ > h , h > o 

By using notation specified earlier we can derive the equation for A* x (t 2 — ti) as 

.Y.sjesVsJ + PS.j-ih-h) 



Ai,dh,t 2 ] < a* x (t 2 - h) = l x \ r- 

se{dp) 



1 X X Phi 



+ -^o^ + A,r(^-M i x ^ x ft 



By combining Equation (21.12) and inequality Equation (21.15), we have 



(21.15a) 



E 



sj£g 



< A* l l (t 2 -h)+ 

geh,g^\ 



E A L-fe-^i) 



.sj£g 



Vf 2 > t[ > t\ , h > o 



Finally, if C, is the link capacity of the radio band (channel) ;', by applying the inequality 
Equation (21.13) we have 



C, x At e < A* hl (At e + t[ - fi) + 

geh, g^l 

Vr 2 > t[ > h , h > 0 



^ ct,-,j + p ( -j x (Ar c + ?; - ?i) 



JJ£g 



(21.16) 



From the above inequality, we calculate the maximum value of At e . 

The intrapriority delay is delay contributed from the SCFQ scheduling delay. It is the 
delay during which a heading block of a session waits for the heading blocks of some 
other sessions with the same priority to be served. Following the results presented by 
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Golestani [51], we can derive the maximum intrapriority delay of the heading block of a 
session k with priority e as (\k c \ — 1) • {Lid), where C, represents the capacity (b/s) of 
link (band) i, K e represents the set of backlog sessions for priority e, and \ic e | represents the 
number of backlog sessions in the priority e. 

21.3.3 Heading-block delay 

A heading block will encounter an intrapriority delay, an interpriority delay, and a block 
transmission delay of its own. The heading-block delay, denoted HD, for an MS depends 
on the transmission rates and can be derived as 

(HD e k) h = \K e \- At e + (\ Ke \ - 1) • ^ + — ^ (21.17) 
(HD e . k f = \K e \- At e + (\ Ke \ - 1) • A + (21.18) 

where (r^)' 1 and (r e J t) hl are the service rate (b/s) of session k of priority e, i.e. the high 
and low service rate, within low-IF region and high-IF region, respectively. The item on the 
further right-hand side in each of the above equations represents the transmission delay of 
the heading block of the session k with the priority e. Since At e is likely to be short under 
the control of the leaky-bucket regulator, HD is relatively small compared with the mean 
duration that an MS will stay in an interference region. Therefore, it was assumed that the 
interference condition will not change during a heading-block delay. 

The queueing delay of a data block is the time during which the block waits until the 
block is selected for transmission. Clearly, a data block newly arrived in a queue cannot 
be served by the corresponding SCFQ server until all the proceeding data blocks in the 
same queue have been served. Therefore, the queueing delay of a data block includes the 
time that the block waits for it to become a heading block itself, plus the interpriority and 
intrapriority delays that the block encounters when it becomes a heading block. 

If b Ctk represent the mean queue length of a session k with a priority e then from Little's 
result, from Chapter 6, the mean queue length encountered by a newly arrived data block is 
equal to the block arrival rate multiplied by the mean heading-block delay time. Parameter 
b et k is given as 

Kk = r^l • [(HD e , k f ■ P u + (HD e , k ) hi ■ Pm] (21.19) 
Now, the mean queueing delay w e ^ for the session k with a priority e can be calculated as 
w e ,k = r^l • [(HD eJ f ■ Pn+(HD e , k f ■ F u f + \ Ke \ ■ At e + (\k c \ - 1)^ (21.20) 

21.3.4 Interference model 

We assume that the general interference model is an interrupted poisson process with 
transition probabilities of a and p. Hence, the duration that an MS is within the low- region 
or the high-interferece region can be represented, respectively, by an exponential distribution 
1 - &- a4 , denoted L(f), or 1 - e - ^', denoted H(t). 

The interference state in which a block is served is determined by the starting state and 
the waiting time of the block. Therefore, we use an alternating renewal process to calculate 
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the probability of the interference state in which a block is served. In this alternating renewal 
process, the block waiting time can be divided into several renewal intervals. A renewal 
interval consists of two exponential distributions, L(t ) and H(t). Let F(t ) be the convolution 
sum of L(t ) and //(f)- We use the notation P^(W) to represent the probability that a block 
with a waiting time W arrives when an MS is within a low-interference (IF) region and 
is served when the MS is within a high-IF region. Following the same convention, the 
notations of probabilities (W), W), and F^(W) should be self-explanatory. These 

probabilities can be derived as [52]: 



Pu<W) = [1 - L(W)] + [1 - L(W - 
Jo 



yW 



p£\w) = l - p£(W)p£(W) = [l - H(W)] + 

P*(W) = 1 - P*(W) 



n=l 



/ [1 

Jo 



H(W - y)]d 



(21.21) 



21.3.5 Normal delay of a newly arrived block 

Both the MS-terminated downlink data block and the MS-originated uplink block transmis- 
sion requests may arrive at an SCFQ queue when the MS is within either a low-IF region 
with a probability Ph or a high-IF region with a probability Phi- A block may be served 
when the MS is within a high-IF region or a low-IF region. So, the normal delay (without 
retransmissions) of a newly arrived data block, is equal to (b etk + 1) • HD ek , where HD e j; 
can be (HD ek ) h or (HD ek ) m . By using the above Equation (21.21), we can obtain the 
normal delay (ND e ^) of a block with a mean waiting time of w e<k as 

ND e , k = P h ■ [(b e , k + 1) • {HD e , k f ■ P£(u> e>t ) 

+ (b e , k + 1) • (HD e , k f ■ F£(w eJc )] + P hi • [{b e ,k + 1) • (HD e , k ) hi ■ P£(u> e , t ) 
+ (b e , k + 1) • {HD e , k f ■ P£ s (w e , k )] (21.22) 

The retransmission delay o/each retransmission consists of two delays, the waiting time 
P a of a selective ARQ request, and the retransmission time P r . The transmission of a data 
block and the retransmission of this block may occur in either interference condition. In 
other words, a retransmission cycle may start and end in either interference region. Let A 
denote the mean retransmission delay starting from a low-IF region, whereas B denotes 
the mean retransmission delay starting from a high-IF region. Hence, we can describe the 
mean retransmission delays A and B by two cross recursive equations, as shown below. As 
a consequence, the retransmission delay (RD e<k ) of a block can be obtained as 

A = P£ e {(T. d + T T ) + A ■ P*(T a + T r ) + B- P*(T a + T r )} 
B = P h b i e {(P a + T r ) + B ■ P*(T a + T t ) + A ■ P^-*(T a + T r )} 

RD e , k = P U - [A - P» + J-^ + B - P- + ( -^) 

+ Pm • \b ■ P£ (*., + ^)+A ■ P» (*., + ^) | (21.23) 
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21 .3.6 High service rate of a session 

By summing up the normal delay, Equation (21.22), and the retransmission delay, Equation 
(21.23), of a newly arrived block, the SR calculator can calculate the total delay (7D e _t) 
for a newly arrived block in a session k with a priority e. For simplicity it is assumed that 
the low service rate of a session when the MS is within a high-IF region is a predefined 
value. The value of this low service rate can be chosen by a user application in accordance 
with the required characteristics of the media stream used in the application. For example, 
the low service rate can be assigned as the minimum tolerable decoding rate of a Motion 
Pictures Expert Group (MPEG) video stream. 

If we assume that a session k with a priority e has a QoS specification for m block 
transmissions with air-interface delay bound D e ^, then the SR calculator can calculate the 
high service rate of the session k with a priority e under the constraint of the inequality 
TD e k < D e t/m. 

21.3.6.1 Performance example 

The three algorithms are compared by simulation with the same parameters as in Yang et 
al. [45]. The results are shown in Figure 21.8. 

21.4 QoS IN OFDMA-BASED BROADBAND WIRELESS ACCESS SYSTEMS 

Vector orthogonal frequency division multiplexing (VOFDM) is considered as a base for 
BWA systems by the Broadband Wireless Internet Forum (BWIF) [1]. The BWA system is 
used with existing wireless LAN technologies such as IEEE802.il (a, b) and IEEE 802.16 
Group aims to unify the BWA solutions [56]. 802.16 Group issued standards in the 10-66 
GHz bands and IEEE802. 16a Group was formed to develop standards to operate in the 2-1 1 
GHz bands in which channel impairments, multipath fading and path loss become more 
significant with the increase in the number of subscribers. 

System performance at high transmission rates depends on the ability of BWA system 
to provide efficient and flexible resource allocation. Recent studies [55,57] on resource 
allocation demonstrate that significant performance gains can be obtained if frequency 
hopping and adaptive modulation are used in subcarrier allocation, assuming knowledge of 
the channel gain in the transmitter. 

The resource allocation problem has been considered in many studies. Almost all of them 
define the problem as a real-time resource allocation problem in which QoS requirements 
are fixed by the application. QoS requirement is defined as achieving a specified data 
transmission rate and BER of each user in each transmission. In this regard, the problem 
differs from the water-feeling schemes wherein the aim is to achieve Shannon capacity 
under the power constraint [57]. 

Therefore, in this section we consider the problem where K users are involved in the 
OFDMA system to share N subcarriers. Each user allocates nonoverlapping set of subcar- 
riers St where the number of subcarriers per user is J(k). In the following, X k (l) represents 
the Ith subcarrier of the FFT block belonging to the kth user. X^{1) is obtained by coding 
the assigned c bits with the corresponding modulation scheme. In the downlink the X^{1) 
are multiplexed to form the OFDM symbol of length (N + L) with the appended guard 
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Figure 21.8 System performance. (Reproduced by permission of IEEE [45].) 



prefix L in order to eliminate ISI. At the uplink, the equivalent overall OFDM symbol (with 
a synchronization error) has the form 

K-im-i 

X ^=EE Xk(n)e j (2jr/N)[I k (n)]l (21.24) 

k=0 n=0 

where n — —L, . . . , N — 1, and 4(n) denotes the subcarrier assigned to the A:th user. Re- 
source allocation mechanism associates the set of subcarriers to the users with different bits 
loaded into them. The received signal from the j'th user can be represented as 

yj {l) = x{l)®hj{l) + w{l) (21.25) 

where hj(t) is the baseband impulse response of the channel between BS and the jth user. 
yj(l) is the received signal yj(t) sampled at rate 1/T. The first L samples are discarded and 
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the N -point FFT is computed. The data of the jth user is 

Y . M = ( Xj(n)Hj(ij(n)) + W(n), if ij(n) e Sj 
j \ 0, otherwise 

where Hj(n) :— ^ h j(i) exp[j2(jt/N)ni] is the frequency response of the channel of the 
A:th user. 

The allocation module of the transmitter assigns subcarriers to each user according to 
some QoS criteria. QoS metrics in the system are rate and BER. Each user's bit stream 
is transmitted using the assigned subcarriers and adaptively modulated for the number of 
bits assigned to the subcarrier. The power level of the modulation is adjusted to meet QoS 
for given fading of the channel. The transmission power for the AWGN channel can be 
predicted. In addition the channel gain of subcarrier n to the corresponding user k should be 
known. The channel gain of the subcarrier is defined as a ki „ = H k (n)* PL k , where PL is the 
path loss, defined by PL k — PL (d 0 ) + 10a log 10 (d k /d a ) + X a . Parameter d 0 is the reference 
distance, d k is the distance between transmitter and receiver, a is the path loss component 
and X a is a Gaussian random variable for shadowing with a standard deviation er[58, 59]. 

With the known channel information, the objective of resource allocation problem can 
be defined as maximizing the throughput subject to a given total power constraint regarding 
the user's QoS requirements. 

If Yk,n is the indicator of allocating the rath subcarrier to the feth user, the trans- 
mission power allocated to the wth subcarrier of A:th user can be expressed as P kn — 
fk(ck, n ,BERk)/af n , where fk(ct, n) is the required received power with unity channel gain 
for reliable reception of q,„ bits per symbol. Therefore, the resource allocation problem 
with an imposed power constraint can be formulated as 



mzx. Ct ,n,n,n R k = ^c k ,„Yk,n for all* 



«=i 

, . .. D Y^V^ fk(Ck,n, BER k ) ,.,.„ 

subject toP T = 2_2-^, 2 3*." - Pma * (21.27) 

k=l n=\ a k,n 

The limit on the total transmission power is expressed as P max for all n € {1, ... , N], k € 
{1, . . . , K} and c k _„ € {1, . . . , M\. If there is no power constraint, Equation (21.27) is 
changed in order to minimize Pj subject to allocating R k bits for all k. In other words 
problem is to find the values of the y ki „ and the corresponding c k<n while minimizing P T . 

The optimal solution in a multiuser environment with multiple modulation techniques 
is complicated since it needs to pick the subcarriers in balance. The problem can be 
classified according to each set of bits assigned to a subcarrier. For a user k, f k (c kt „) € 
{/^(l, BER k ), . . . , f k (M, BER k )}and M times [K * N] power matrices {P c } can be con- 
structed for each c. For a constant c, {/(c)} can be computed and the transmission power 
requirement can be found. The dimension of the indicator function is incremented and 
represented now by Yk,n,c an d defined as 

Yk ^ = 0, otherwise 0 (2L28) 



The above problem can be solved with integer programming (IP) . We refer to the IP approach 
as the optimal solution to the resource allocation problem. There are K ■ N ■ M indicator 
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variables and M power matrices where the entries of each matrix for a given c can be found 
from 

Mc, BER k ) 

P c kn = — — = (21.29) 

a k,n 

Using Equation (21.29) as an input, the cost function now can be written as 

K N M 
k=\ n=\ c=l 

and the description of the IP problem is 

minPr, fory M>e e {0, 1} (21.31) 

Yk,n,c 

subject to 

N M KM 

Rk = c ■ Yk,n,c, for all/:, andO < Yk,n,c, < 1, for all« 

n=l c=l k=\ c=l 

Although the optimal solution gives the exact results, from an implementation point of 
view, it is too complex. This leads to searching suboptimal solutions that are fast and close 
to the optimal solution. Suboptimal solutions, in most attempts to simplify the resource 
allocation, decompose the problem into two procedures, a subcarrier allocation with fixed 
modulation, and bit loading. Subcarrier allocation with fixed modulation deals with one P c 
matrix with fixed c and then, by using bit loading scheme, the number of bits is incremented. 

Subcarrier allocation is based on the fact that fk(x, y) is a convex function [57]. We 
can start with P^ n and we can define new with J2k=\ — ^> which can be obtained 
by decrementing Rk properly. Then the solution to this problem can be solved with Linear 
Programming or Hungarian problem. 



Linear programming 

For simplicity, we briefly restate the problem description, 



K N 



P T = minJ2 P lnPk,n Pk,n e [0, 1] (21.32) 

k=l n=\ 

subject to 

N K 

J2pk,n = RkVke{l,...,K] and = ! Vn e {1, . . . , N] 

n=\ k=\ 

After linear programming, the [K ■ N] allocation matrix has entries ranging between 0 and 
1 . The entries are converted to integers by selecting the highest R^ nonzero values from 
columns for each k and assigning them to the kth user. 



Hungarian algorithm 

The problem described above can also be solved by an assignment method such as the 
Hungarian algorithm [58]. The Hungarian algorithm works with square matrices. Entries 
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of the square matrix can be formed by adding R k times the row of each k. The problem 
formulation is as 

N N 

P T = min Y, P lnPk,n Pk.n e {0, 1} (21.33) 

k=\ n=l 

and the constraints become 

N N 

J2Pk,n = l Vne{l,...,tf} and £p t ,„ = l V* e {1, . . . , AT} 

B=l Jfc=l 

Although the Hungarian method has computation complexity 0(n 4 ) in the allocation 
problem with fixed modulation, it may serve as a base for adaptive modulation. 

The Bit loading algorithm (BLA) is used after the subcarriers are assigned to users that 
have at least R k bits assigned. The bit loading procedure is as simple as incrementing bits 
of the assigned subcarriers of the users until P T < P m . dx . IfA7\„(c) is the additional power 
needed to increment one bit of the wth subcarrier of kth user AP kn (c kn ) = [f(c ki „ + 1) 
— f(c ktn )]/a kn , then the bit loading algorithm assigns one bit at a time with a greedy 
approach to the subcarrier as {argmin t n AP kn (c kt „)}. 

BL algorithm 

(1) For all n, Set c^ n — 0, AP k<n {c ktn ), and P T — 0; 

(2) Select h — arg min n AP ktn (0); 

(3) Setc kfi = c kfi + 1 and P T — P T + AP t ,„fe,„); 

(4) Set Aft,„(c M ); 

(5) Check P T < P max and R k for Vk, if not satisfied GOTO STEP 2. 

(6) Finish. 

The Hungarian approach and LP approach with bit loading appear as two different subop- 
timal solutions to the resource allocation with adaptive modulation. In the sequel they will 
be referred to as Greedy Hungarian (GH) and Greedy LP (GLP). 

21.4.1 Iterative solution 

The GreedyLP and GreedyHungarian methods both first determine the subcarriers and then 
increment the number of bits on them according to the rate requirements of users. This may 
not be a good schedule in some cases, like a user with only one good subcarrier and low rate 
requirement. The best solution for that user is allocating its good carrier with high number of 
bits. However, if GreedyLP or GreedyHungarian is used, the user may have allocated more 
than one subcarrier with lower number of bits and, in some cases, its good subcarrier is never 
selected. Consider another scenario where a user does not have any good subcarrier (i.e. it 
may have a bad channel or be at the edge of the cell). In this case, rather than pushing more 
bits and allocating fewer subcarriers, as in GreedyLP and GreedyHungarian, the opposite 
strategy is preferred since fewer bits in higher number of subcarriers give a better result. 
Another difficulty arises in providing fairness. Since GreedyLP and GreedyHungarian are 
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based on a greedy approach, the user in the worst condition usually suffers. In any event, 
these are complex schemes and simpler schemes are needed to finish the allocation within 
the coherence time. To cope with these challenges, in the following a simple, efficient and 
fair subcarrier allocation scheme is introduced with iterative improvement [53]. 

The scheme is composed of two modules, referred to as scheduling and improvement 
modules. In the scheduling section, bits and subcarriers are distributed to the users and passed 
to the improvement module where the allocation is improved iteratively by bit swapping 
and subcarrier swapping algorithms. 

The, fair scheduling algorithm starts the allocation procedure with the highest level of 
modulation scheme. In this way, it tries to find the best subcarrier of a user to allocate the 
highest number of bits. In Koutsopoulos and Tassiulas [55] the strategy is described by an 
analogy: 'The best strategy to fill a case with stone, pebble and sand is as follows. First filling 
the case with the stones and then filling the gap left from the stones with pebbles and in 
the same way, filling the gap left from pebbles with sand. Since filling in opposite direction 
may leave the stones or pebbles outside". With this strategy more bits can be allocated and 
the scheme becomes immune to uneven QoS requirements. The fair scheduling algorithm 
(FSA) runs greedy release algorithm (GRA) if there are nonallocated subcarriers after the 
lowest modulation turn and the rate requirement is not satisfied. GRA decrements one bit 
of a subcarrier to gain power reduction, which is used to assign higher number of bits to 
the users on the whole. FSA is described as follows. 

FS algorithm 

(1) Set c — M, Select a k, and Pj — 0; 

(2) Find h — arg min n P k c n ; 

(3) Set R k — R k — c and p kt „ — 1, update Pj, shift to the next k\ 

(4) IfPj > P m . dx , step out and set c = c - 1, GOTO STEP 2. 

(5) IfVk, R k < c, setc = c - 1, GOTO STEP 2. 

(6) If {c 1), 2Zsk=i Hn=\ Pk,n < N , Pj > P m -dx, run "greedy release" and GOTO 

STEP 2. 

(7) Finish. 

The greedy releasing algorithm tends to fill the unallocated subcarriers. It releases one 
of the bits of the most expensive subcarrier to gain power reduction in order to drive the 
process. GRA works in the opposite direction to BLA. GRA is described as follows. 

GR algorithm 

(1) Find {k, h, c kfi ) = arg max k , n ^ c P c kn p k ^c\ 

(2) Set c ky „ = c k , n - 1, P T = Pt ~ ^Pk,n( C k,h)> 

(3) Setc = c kfl - 1; 

(4) Finish. 
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The horizontal swapping algorithm (HSA) aims to smooth the bit distribution of a user. 
When the subcarriers are distributed, the bit weight per subcarrier can be adjusted to reduce 
power. One bit of a subcarrier may be shifted to the other subcarrier of the same user if 
there is a power reduction gain. Therefore, variation of the power allocation per subcarrier 
is reduced and a smoother transmission is performed. HSA is described as follows. 

HS algorithm 

(1) Set P c = oo; 

(la) Find {k, h, c kh } = arg max knc (P c kn p k , n ) < P c Vc; 

(2) Define n € S k , where {p kt „ —— l}/orVw ; ; 

(3) Set A n = max n AP- kfi (c kfi - 1) - AP kil (c kn ), h e S k ; 

(4) SetP c = Pl n ; 

(4a) If A h > 0, set P T = P T - A n 

(4b) Set c kfi - c kfi - 1, c kn - c kA + 1 GOTO Step (la) 

(5) If{Pc = mtn k ,„,c( P k, n Pk,n)}, finish. 

The vertical swapping algorithm (VS A) does vertical swapping for every pair of users. In 
each iteration, users try to swap their subcarriers such that the power allocation is reduced. 
There are different types of vertical swapping. For instance, in triple swapping, user ;' gives 
its subcarrier to user j and in the same way user j to user k and user k to user i. In 
Koutsopoulos ad Tassiulas [55], pairwise swapping is modified to cope with the adaptive 
modulation case. In this case, there is more than one class where each class is defined with 
its modulation (i.e number of bits loaded to a subcarrier) and swapping is only within the 
class. Each pair of user swap their subcarriers that belong to the same class if there is a power 
reduction. In this way, adjustment of subcarrier is done across users, to try to approximate 
the optimal solution. VSA is described as follows. 

VS algorithm 

(1) V pair of user [i, j}; 

(la) Find dP u {n) = Pf n - Pj n and A"P,j = maxdP u (n), Vw e Sr, 

(lb) Find dPj, i(n) = Pj n - P? n A" Pjj = max dPjj(n), Vn e Sj; 
(lc) Set Q. h '"Pij = A R Pij + A A Pjj; 
(Id) Add Q h '"Pij to the {A} list; 

(2) Select Q — max,. „. , . ~,A; 

(i,b),0,b) 

(3) IfQ > 0, Switch subcarriers and P T = P T - £2 GOTO STEP (la); 

(4) IfQ< 0, finish. 

21.4.2 Resource allocation to maximize capacity 

Suppose there is no fixed requirements per symbol and the aim is to maximize capacity. 
It has been shown in Viswanath et al. [60] that, for point-to-point links, a fair allocation 
strategy maximizes total capacity and the throughput of each user in the long run, when 
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the user's channel statistics are the same. This idea underlying the proposed fair scheduling 
algorithm is exploiting the multiuser diversity gain. 

With a slight modification, the fair scheduling algorithm for point-to-point communi- 
cation was extended to an algorithm for point-to-multipoint communication [53]. Suppose 
the user time varying data rate requirement R k (t ) is sent by the user to the base station as 
feedback of the channel condition. We treat symbol time as the time slot, so t is discrete, 
representing the number of symbols. We keep track of average throughput t k , n of each user 
for a subcarrier in a past window of length t c . The scheduling algorithm will schedule a 
subcarrier h to a user k according to the criterion 

{k, n} — argmax(r tjB /fifc, B ) 

k,n 

where t k ,„ can be updated using an exponentially weighted low-pass filter described in 
Viswanath et al. [60]. Here, we are confronted with determining the r ki „ values. We can 
set r kjt to RkIN, where N is the number of carriers. With this setting, the peaks of the 
channel for a given subcarrier can be tracked. The algorithm schedules a user to a subcarrier 
when the channel quality in that subcarrier is high relative to its average condition in that 
subcarrier over the time scale t c . When we consider all subcarriers the fairness criterion 
matches with the point-to-point case as 



k = max RklTt, where T k = J t kj „ 



The theoretical analysis of fairness property of the above relation for point-to-point com- 
munication is derived in Viswanath et al. [60]. Those derivations can be apply for point-to- 
multipoint communication. 

21.4.2. 1 Performance example 

The required transmission power for c bits/subcarrier at a given BER with unity channel 
gain is [57]: 

2 

(2 e - 1) 



f(c,BER)=^ 



where Q 1 (x) is the inverse function of Q(x) — -j= f e—t 2 /2dt 

X 

Figure 21.9 shows the average data rates per subcarrier vs total power constraint when there 
are four users. Each user has a rate requirement of 192 b/symbol (maximum rate) and 
BER requirement of 10~ 4 . The performance of the iterative approach is close to that of the 
optimal and difference between suboptimal and iterative approaches decreases as the total 
transmit power increases. 



21 .5 PREDICTIVE FLOW CONTROL AND QoS 

Even if the dimensioning of network resources has been done correctly and the admission 
control mechanism is good, the network may go into periods of congestion due to the 
transient oscillations in the network traffic. For this reason it is necessary to develop a 
mechanism to quickly reduce the congestion or pre-empt it, so as to cause the least possible 
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Figure 21.9 Spectral efficiency vs total transmission power. 
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Figure 21.10 Individual link-level model. 

degradation of QoS to the underlying applications [61-81]. 4G networks will carry a mixture 
of real-time (RT) traffic, like video or voice traffic, and nonreal-time (NRT) traffic, like data. 
One approach to controlling the NRT traffic is to be able to predict the RT traffic (at the 
link of interest) at some time in the future, then, based on this prediction, control the NRT 
traffic. 

In Figure 21.10, Vi(n) and a/(«) correspond to the aggregate RT traffic and NRT traffic, 
respectively, arriving at a link of interest (link / having capacity /z/), at time n. One can 
then estimate C/(n), the available link capacity for NRT traffic, at some time in the future. 
This information would then be used at the network-level to distribute the available link 
capacities to the NRT flows. 

On the network level the available link capacities for the NRT flows is then distributed 
to maximize throughput (or more generally some utility function), based on appropriate 
fairness requirements. An example network is shown in Figure 21.11, where flows traverse 
links with available capacities for the NRT flows calculated at the individual link level. In 
Chapter 7, the network-level problem has been investigated in the case when the available 
link capacity for NRT flows at each node is a constant. The problem remains open in the 
case when the available capacity is time-varying. 
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Figure 21.11 Network-level model. 
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Figure 21.12 System diagram of predictive flow control. 
21.5.1 Predictive flow control model 

In this section we focus on the individual link-level problem, a single multiplexing point in 
the network which consists of a link and an associated buffer that serve both RT and NRT 
traffic. The multiplexing point in the network could be an output port of a router/switch or 
a multiplexer. The system diagram is shown in Figure 21.12. 

In the following V(n) will represent the aggregate amount of RT traffic that arrives at 
the queue of interest at time n. V m - dx : = sup n>0 { V(n)} will be assumed finite and V(n) 
stationary in the mean, i.e. V : = E{ V(w)}. The goal is to control the NRT traffic based on 



844 



QUALITY-OF-SERVICE MANAGEMENT 



predicting the aggregate RT traffic arrival rate at the queue. «, («) will refer to the available 
link capacity for the z'th NRT traffic computed at time n based on the predicted value of the 
RT traffic rate. This explicit rate information is sent back to the z'th NRT traffic source. 

If N is the number of NRT traffic sources and n, , 1 = 1 , . . . , N is the round-trip delay 
between the z'th NRT source and the destination, then a (n) — Y,^ =l aj(n — «,) is the aggregate 
NRT traffic arrival to the queue at time n. A control message is propagated from the queue 
of interest to the destination and back to the source. V,(w) will represent the predicted value 
of V (n) based on the history of V before time n — . We assume that the predictor is linear. 
A simple example can be represented as Vj(n + «,) = V,(w — k). If V(z) is the Z- 
transformof V(n) - V and V, : (z) the Z-transform of V, ; (n) - V then t> (z) = z~ n > H t {z)V {z) 
where //, (z), for NRT traffic source i, is a causal, stable, linear, time-invariant system [74]. 
It should be noted that flj(z) will be the same for all sources with the same RTT . For 
example, if there is only one NRT flow with round-trip delay 5, a possible predictor (in 
time domain) could be V\{n + 5) = (1/2) V(n) + (1/2) V(n — 1). In this case, we will have 
H 1 (z) = (l/2)(l + z- 1 ). 

The queue-length, q(n), at time n at the queue of interest will be determined by 
a(n), V(n), and the service rate (link rate) of the queue /a. We assume that the queue 
process begins at time n — 0 and q(0) — 0. For stability, we also require that V < fi. 
The feedback control scheme is as follows. We predict the aggregate RT traffic rate and use 
the predicted value to compute a,-(n), 1 < n < N. 



21.5. 1. 1 Predictive flow control algorithm for single NRT traffic model 

To start with, we assume that there is only one NRT traffic source a^n) (or a group of 
NRT traffic loops with the same round-trip delay m). So, by definition, a(n) — a\(n — n\). 
Ideally, what we want to achieve is a(n) + V(n) — \jl at all time n. However, there are 
two difficulties in achieving this. First, since we do not know V(n) in advance, we need 
to estimate its value through prediction, resulting in a certain possibility of error. Second, 
V (n) could be greater than \i but since a(n) cannot be negative, the sum a(n) + V(n) cannot 
be made equal to [i. Having in mind the possibility of prediction error and the possibility 
that V(n) > /x, the NRT traffic a\(n) will be controlled as 

a x {n) = \pn- t>i(« + m)] + (21.34) 

where p is the percentage of output link capacity that we would like to utilize [p > (V//i)] 
and [x] + = x, if x > 0, and [x] + — 0, otherwise. Consider now the situation of perfect 
prediction, and let V{n) have exceeded /x for some time. During the period that V(-) has 
exceeded fx, the above equation correctly sets ci\(-) to zero, but even after V(-) is no longer 
larger than /x, there could still be a substantial backlog in the queue, during which time 
ai(-) should be set to zero. However, according to Equation (21.34), the moment V(n) is 
less than [i, the NRT source is allowed to transmit, thus potentially causing unnecessary 
congestion at the queue. So, what we will attempt to do is to keep the queue length at the 
node of interest small, while maintaining a certain level of throughput given by 

hm — '— Pi jL (21.35) 



For this reason we define the control algorithm [81] as follows. 
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Control algorithm (N — 1 case) 

(1) Define a virtual queueing process q\{n) and set q\(G) = 0. 

(2) qi(n) - [qi(n - 1) + Vi(n) - pn] + . For n < 0, we let V(n) = 0. 

(3) a\(n) — [pfi — V\{n + «i) — q\(n + n\ — 1)] + . Forn < 0, we let a\(n) — 0. 
For N NRT traffic sources the algorithm is modified as follows. 

Control algorithm (N > 1 case) 

(1) SetqM = 0, 1 < i < N. 

(2) qi(n) - [qi(n - 1) + V,(m) - pfi] + . Forn < 0, we let V(n) = 0. 

(3) a,(") = [PA* " Vi(n + «,) - ?i (n + - 1)]+/7V. 

(4) For n < 0, w let a, («) = 0. 

2t.5. t.2 Performance example 

Three different alternatives for predictor are used [81]. In Al, the fixed low-pass filter 
is chosen as H LPF (z) — (1/4)(1 + z~' + z~ 2 + z -3 ). The MMSE predictor is designed as 
follows. First, a low-pass filter Al is applied to the high-priority RT traffic. Next, a standard 
minimizing mean square error linear predictor //mmse(z) with the form 5Zm=o ^m z ~"'~ m 
is calculated based on the low-frequency part of the RT traffic. The final MMSE predictor 
is z~ ni Hi(z) — H lpf (z)Hmmse(z)- Note that #mmse(z) will require explicit knowledge of 
the round-trip delays per-flow information «, . In A3 that information is approximated by 
an average value « 0 for all flows. The prediction error is defined as A(j) — V(j) — V\(j). 
The queue length at the node of interest is bounded by [79] 




Under the definitions and predictive flow control algorithm defined above, if V max < oo 
and [(pfi - V)/(ji - V)] < ffi(l) < 1, we have [81] q(n) - q 0 (n) < 2C U where C x is a 
constant that does not depend on n. 

For / = n — no we define the accumulated error as 

n n 
j=n-l+l j=n Q +\ 

where E{X n j} — 0 (because the predictor is unbiased) and 

Var{X nJ) = Var ^ A(j) 
[j=„-i+i 

n n 0 0 

= E E C A (jl-j2)= J2 E C A (jl-j2) (21.36) 

j\ =n -l + \ j2=n-l+\ j2=-l+l 

For the results shown below, V(n) is a generated Gaussian process which is 
multitimescale-correlated with C v (k) = 479.599 x 0.999 1 * 1 + 161.787 x 0.99 1 * 1 + 



846 



QUALITY-OF-SERVICE MANAGEMENT 



xl(T 4 10 - 




Figure 21.13 Var{X„ /} with different predictors. (Reproduced by permission of IEEE [81].) 
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Figure 21.14 Tail (cumulative queue length) probabilities with different control algorithms. 
(Reproduced by permission of IEEE [81].) 



498.033 x 0.9^ and V — lOOkb/s. This type of source has often been used to represent 
the multiple time-scale correlation in network traffic [71, 80]. The link capacity is 200 kb/s, 
and the utilization is set to 98%. The time unit is 1 ms and the unit of the queue length is 
1 bit [81]. Var{X„ /}is shown in Figure 21.13. We can see that, for Al and A3 predictors, 
Var{X„ /} converges to some constant when I is large enough. The asymptotic variance of 
A3 is also smaller than that of Al. In Figure 21.14, we compare the above control algorithm 
with the control algorithm that uses Equation (21.34). As we mentioned before, when 
V\(n) < p\L for all n, the control algorithms reduce to the same linear equation. However, 
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when this condition is not true, the two algorithms are quite different. We can see this 
difference in performance in Figure 21.14. In this simulation, there is only one NRT source 
with round-trip delay of 5 ms. The RT source and the link capacity are still the same as 
in Figure 21.13 To see the difference when the condition V\(n) < p/j, is violated, we use 
the same A3 predictor in both control algorithms. In the figure, the control algorithm is 
marked CI, and the one that uses Equation (21.34) is marked C2. The utilization is set to 
p = 98%. Note that when using control algorithm C2, given p, the utilization is not equal 
to p. In this simulation, we set p such that the measured utilization for C2 is 98%, which is 
the same as in CI. From this figure, using the same predictor, we can see that the improved 
control algorithm outperforms the one that uses Equation (21.34). 
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